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We review the data gathering and analysis procedure used in real Einstein-Podolsky-Rosen-Bohm 
experiments with photons and we illustrate the procedure by analyzing experimental data. Based 
on this analysis, we construct event-based computer simulation models in which every essential 
element in the experiment has a counterpart. The data is analyzed by counting single-particle 
events and two-particle coincidences, using the same procedure as in experiments. The simulation 
models strictly satisfy Einstein's criteria of local causality, do not rely on any concept of quantum 
theory or probability theory, and reproduce the results of quantum theory for a quantum system of 
two 5 = 1/2 particles. We present a rigorous analytical treatment of these models and show that 
they may yield results that are in exact agreement with quantum theory. The apparent conflict 
with the folklore on Bell's theorem, stating that such models are not supposed to exist, is resolved. 
Finally, starting from the principles of probable inference, we derive the probability distributions 
of quantum theory of the Einstein-Podolsky-Rosen-Bohm experiment without invoking concepts of 
quantum theory. 
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I. INTRODUCTION 

As nanofabrication technology is advancing from the 
stage of scientific experiments to the stage of building 
nanoscopic systems that perform useful tasks, it is im- 
portant to have computational tools that allow the de- 
signer to assess, with adequate reliability, how the system 
will behave [ij. Quantum theory provides the founda- 
tion for developing these tools. However, just like any 
other theory, quantum theory has its own limitations. If 
the successful operation of the device depends on indi- 
vidual events rather than on the statistical properties of 
many events, quantum theory can no longer be used to 
describe the behavior of the device. Indeed, as is well- 
known from the early days in the development of quan- 
tum theory, quantum theory has nothing to say about 
individual events [1, [1, [3|- Reconciling the mathemat- 
ical formalism that does not describe individual events 
with the experimental fact that each observation yields a 
definite outcome is referred to as the quantum measure- 
ment paradox and is the most fundamental problem in 
the foundation of quantum theory [J . 

Computer simulation is widely regarded as comple- 
mentary to theory and experiment . If computer sim- 
ulation is indeed a third methodology, it should be pos- 
sible to simulate quantum phenomena on an event-by- 
event basis. In view of the fundamental problem alluded 
to above, there is little hope that we can find a simula- 
tion algorithm within the framework of quantum theory. 
However, if we think of quantum theory as a recipe to 
compute probability distributions only, there is nothing 
that prevents us from stepping outside the framework 
that quantum theory provides. 

To head off possible misunderstandings, it may be im- 
portant to rephrase what has been said. Of course, we 
could simply use pseudo-random numbers to generate 
events according to the probability distribution that is 
obtained by solving the time-independent Schrodinger 



equation. However, that is not what we mean when we 
say that within the framework of quantum theory, there 
is little hope to find an algorithm that simulates the in- 
dividual events and reproduces the expectation values 
obtained from quantum theory. The challenge is to find 
algorithms that simulate, event-by-event, the experimen- 
tal observations that, for instance, interference patterns 
appear only after a considerable number of individual 
events have been recorded by the detector [3] , without 
first solving the Schrodinger equation. 

In a number of recent papers [1, [^ [13, [lH [13, [l^ - 
we have demonstrated that locally-connected networks of 
processing units with a primitive learning capability can 
simulate event-by-event, the single-photon beam splitter 
and Mach-Zehnder interferometer experiments of Grang- 
ier et al. Furthermore, we have shown that this ap- 
proach can be generalized to simulate universal quantum 
computation by an event-by-event process [3, Il2l . [isj . 
Therefore, at least in principle, our approach can be used 
to simulate all wave interference phenomena and many- 
body quantum systems using particle-like processes only. 
This work suggests that we may have discovered a pro- 
cedure to simulate quantum phenomena using causal, 
Einstein-local, event-based processes. Our approach is 
not an extension of quantum theory in any sense nor is 
it a proposal for another interpretation of quantum me- 
chanics. The probability distributions of quantum theory 
are generated by local, causal processes. 

According to the folklore about Bell's theorem, a pro- 
cedure such as the one that we discovered should not 
exist. Bell's theorem states that any local, hidden vari- 
able model will produce results that are in conflict with 
the quantum theory of a system of two S' = 1/2 parti- 
cles [Tj]. However, it is often overlooked that this state- 
ment can be proven for a (very) restricted class of prob- 
abilistic models only. Indeed, minor modifications to the 
original model of Bell lead to the conclusion that there 
is no conflict [l^, [H, [13 • In i&ct, Bell's theorem does 
not necessarily apply to the systems that we are inter- 
ested in as both simulation algorithms and actual data do 
not need to satisfy the (hidden) conditions under which 
Bell's theorem hold [13, [H, [lol- Furthermore, we have 
given analytical proofs that two-particle correlations of 
the simulation models ag ree exactly with the quantum 
theoretical expression [2l[, [23] ■ 



A. Aim of this work 

In this paper, we take the point of view that the fun- 
damental problem, originating from the work of Ein- 
stein, Podolsky, and Rosen (EPR) [13], reformulated by 
Bohm j3|] and studied in detail by Bell [ij], is to explain 
how individual events, registered by different detectors 
in such a way that a measurement on one particle does 
not have a causal effect on the result of the measurement 
on the other particle (Einstein's criterion of local causal- 
ity), exhibit the correlations that are characteristic for a 
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FIG. 1: Logical relationship between data and theory. 



quantum system in the entangled state. We assume that: 

• The experimental data, including the (post) pro- 
cessing of it, constitutes the set of facts 



• The experimental facts are a faithful representation 
of the results of the ideal experiment 



• Quantum theory is compatible with these facts. In 
the quantum physics community, it is generally ac- 
cepted that the results of Einstein-Podolsky- Rosen- 
Bohm (EPRB) experiments agree with the predic- 
tions of quantum theory [2|, [2^, (23, H, S S 
MM 



In this paper, we review constructive proofs that there 
exist (simple) computer simulation algorithms that sat- 
isfy Einstein's criterion of local causality and exactly re- 
produce the results of the quantum theoretical descrip- 
tion of real EPRB experiments [HI, [H, [H [13, HI . These 
algorithms generate the same type of data as experiments 
and employ the same procedure as used in experiments to 
analyze the data. In view of the quantum measurement 
paradox |2|, |4| , the latter prohibits the use of algorithms 
that rely on (concepts of) quantum theory. In addition, 
for the reasons explained later, these simulation algo- 
rithms do not rely on techniques of inductive inference 
(probability theory) to draw conclusions from the data. 
In this paper, we also discuss the apparent conflict with 
Bell's theorem. 

To appreciate the fundamental issues that are involved, 
it is necessary to understand well the logical relation be- 
tween computer simulation, experiment and theory on 
the one hand and data and theory on the other hand. 
Therefore, we first elaborate on these relationships. 



B. Computer simulation versus experiment and 
theory 

In general, and in the analysis of real EPRB experi- 
ments [3, [23} in particular, it is important to recognize 
that there are fundamental, conceptual differences be- 
tween the set of experimental facts, their interpretation 
in terms of a mathematical model, and a computer sim- 
ulation of the facts. 

Obviously, because of limited precision of the instru- 
ments, any record of experimental facts is just a set of 
integer numbers (floating point numbers have a finite 
number of digits and can therefore be regarded as integer 
numbers). Theories that describe Newtonian mechanics 
or electrodynamics assign real numbers to experimentally 
observable quantities. The relation between theory and 
experimental data is one-to-one: The experimental accu- 
racy determines the number of significant digits of the 
real numbers. These theories have a deductive character. 

Quantum theory assigns a probability, a real number 
between zero and one, for an event (= experimental fact) 
to occur [3, [3, [3^. However, we can always use an in- 
teger number to represent the event itself (in any real 
experiment the number of events is necessary finite). By 
assigning probabilities to events, we change the character 
of the theoretical description on a fundamental level: In- 
stead of deduction, we (have to) use inductive inference 
to relate a theoretical description to the facts 2, 3^. 

Although probability theory provides a rigorous math- 
ematical framework to make such inferences, there are 
ample examples that illustrate how easy it is to make 
the wron g iri ference, also for mundane, every-day prob- 
lems [37l. l38l I39I lioj that are not related to quantum 
mechanics at all. Subtle mistakes such as dropping some 
of the conditions [4l| , or mixing up the meaning of phys- 
ical independence and logical independence, can give rise 
to all kinds of paradoxes [H, [H, [S [S [S, [S [S, [i^ • 

In general, a computer simulation approach does not 
need the machinery of probability theory to relate sim- 
ulation data to the experimental facts. A digital com- 
puter can generate sets of integer numbers only. We 
can compare these numbers to the experimental data di- 
rectly, without recourse to inductive inference. On the 
one hand, this puts computer simulation in the luxury 
position that it cannot suffer from mistakes of the kind 
alluded to earlier, simply because there is no need to use 
inductive inference. On the other hand, using the com- 
puter, we are strictly bound to the elementary rules of 
logic and arithmetic. Therefore, it is not legitimate to 
use arguments such as "in an experiment it is impossible 
to repeat the experiment twice and get exactly the same 
answer" . While this statement is correct with very high 
probability, when we use a digital computer it is logically 
false because we can always exactly repeat the same cal- 
culation (we exclude the possibility that the computer is 
malfunctioning). Therefore, in a computer simulation, it 
should be possible to explain the facts without invoking 
"loopholes" such as detection efficiency or counterfactual 
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reasoning. 

A graphical representation of the point of view taken 
in this paper is given in Fig. [1] On the left, we have pro- 
cesses that generate events. Each event is represented by 
one or more numbers, which we call raw data. Experience 
or a new idea provide inspiration to choose one or more 
methods to analyze the data. Typically, this analysis 
maps the raw data onto a few numbers (called averages 
and coincidence counts in Fig. [1]), that is the raw data 
is being compressed. On the right hand side, we have 
several candidate mathematical models, "theories" , that 
may "explain" the results of the data analysis. 

But, how do we relate data to (quantum) theory? It 
is essential to recognize that before we can address this 
question, we have to make the hypothesis that there ex- 
ists some process that gives rise to the observed data. 
Otherwise, we cannot go beyond the description of merely 
giving the data as it is. Furthermore, a useful theoret- 
ical model should give a description of the data that is 
considerably more compact than the data itself. 

Crossing the line that separates the model space from 
the data space requires making the fundamental hypoth- 
esis that the process that gives rise to the data can be de- 
scribed within the framework of probability theory. Only 
then, we are in the position that we can use probability 
theory to relate the mathematical model to the observed 
frequencies. Of course, this is consistent with the fact 
that quantum theory does not describe the individual 
events themselves Q . 

In this paper, the rules of probability theory are mainly 
used as a tool to reason in a logically consistent man- 
ner [33, 1 to make logical inferences about the frequen- 
cies that we can compute from the observed data [37, 40|. 
These inferences concern logical relations which may or 
may not correspond to causal physical influences [37|. 
As we will see later, much of the mysticism surround- 
ing Bell's theorem can be traced back to the failure to 
recognize that probability theory is not defined through 
frequencies. 

To avoid misunderstandings of what we are aiming to 
accomplish here, it may be useful to draw an analogy 
with methods for simulating classical statistical mechan- 
ics • According to the theory of equilibrium statistical 
mechanics, the probability that a system is in the state 
with label n is given by 



where N is the number of different states of the sys- 
tem, which usually is very large. En is the energy of 
the state, and p = l/ksT where ks is Boltzmann's 
constant and T is the temperature. Disregarding ex- 
ceptional cases such as the two-dimensional Ising model, 
for a nontrivial many-body system the partition function 
Z ~ e"''^" is unknown. Hence, p„ is not known. 

Can we construct a simulation algorithm that gener- 
ates states according to the unknown probability distri- 



bution (pi, . . . ,pAr)? An affirmative answer to this ques- 
tion was given by Metropolis et al. [1, IH, |4^ . The ba- 
sic idea is to design an artificial dynamical system, a 
Markov chain or master equation that samples the space 
of N states such that in the long run, the frequency with 
which this system visits the state n approaches Pn with 
probability one 

Looking back at Fig. [1] if we replace "event-by-event 
simulation algorithm(s)" by "Metropolis Monte Carlo 
Method", "Average ... counts" by "Average energy ...", 
and "Quantum theory" by "Equilibrium Statistical Me- 
chanics" , the status of simulation algorithms and theo- 
retical models in these two different fields of physics is 
the same. 

Although in applications to statistical mechanics, the 
Markov chain dynamics is of considerable interest in it- 
self, there obviously is no relation to the Newtonian dy- 
namics of the particles involved [5| . The same holds for 
the dynamical processes that reproduce the results of 
quantum theory: If an event-by-event simulation algo- 
rithm generates the same type of raw data as the ex- 
periment does and the data analysis yields results that 
agree with quantum theory we should be pleased with 
this achievement and not ask for this dynamics to be 
"unique". In fact, in our earlier work we have already 
shown that there exist both deterministic and pseudo- 
random processes that reproduce equally well the prob- 
ability distributions obtained from quantum theory and 
experiments [1, i, [l^, [H [H, [11 • 

C. Disclaimer 

The work reviewed here is not concerned with the in- 
terpretation or extension of quantum theory. The fact 
that there exist simulation algorithms that reproduce the 
results of quantum theory has no direct implications to 
the foundations of quantum theory: The algorithm de- 
scribes the process of generating events on a level of detail 
about which quantum theory has nothing to say (quan- 
tum measurement paradox) 0, Q • The average proper- 
ties of the data may be in perfect agreement with quan- 
tum theory but the algorithms that generate such data 
are outside of the scope of what quantum theory can de- 
scribe. This may sound a little strange but it is not if 
one recognizes that probability theory does not contain 
nor provides an algorithm to generate the values of the 
random variables either, which in a sense, is at the heart 
of the quantum measurement paradox. 



D. Structure of the paper 

The paper is organized as follows. In Section |TT1 we 
review the EPRB gedanken experiment with magnetic 
particles and its experimental realization using the pho- 
ton polarization as a two-state system. We elaborate on 
the data gathering and analysis procedures. An essential 
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ingredient of the data analysis procedure is the time win- 
dow that is used to identify coincidences. In constrast to 
textbook treatments of EPRB experiments in which the 
window is impUcitly assumed to be infinite, in real exper- 
iments the time window is made as small as possible. We 
illustrate the importance of the choice of the time win- 
dow by analyzing a data set of a real EPRB experiment 
with photons [32]. 

Section Hill briefly recalls the essentials of the quantum 
theoretical description of the EPRB experiment in terms 
of a system of two S = 1/2 particles. 

Section IIVI addresses the problem of relating quantum 
theory and real data. In Section HVAl we discuss how to 
generate individual events from the solution of the quan- 
tum theoretical problem and how to relate the quantum 
theoretical expectation values to the actual data. Sec- 
tion IIV CI deals with the inverse problem: How do we 
relate data to (quantum) theory? We elaborate on the 
fundamental difference between probabilities (quantities 
that appear in the mathematical theory) and frequencies 
(numbers obtained by counting events). 

SectionlVl introduces deterministic and pseudo-random 
event-based computer simulation models that satisfy Ein- 
stein's criteria of local causality and reproduce the results 
of the quantum theory of two 5=1/2 particles. We also 
prove that these models can exhibit correlations that are 
stronger than those obtained from the quantum theory 
of two 5=1/2 particles. 

In Sect ion rvTl we resolve the apparent conflict between 
the fact that there exist event-based simulation models 
that satisfy Einstein's criteria of local causality and re- 
produce the results of the quantum theory of two 5=1/2 
particles and the folklore about Bell's theorem, stating 
that such models are not supposed to exist. We show 
that Bell's extension of Einstein's concept of locality im- 
plicitly assumes that the absence of a causal influence 
implies logical independence [s^ . an assumption which, 
in general, leads to logical inconsistencies [H, [s^ . 

In Section IVIIl we use standard Kolmogorov proba- 
bility calculus to analyze the probabilistic version of our 
simulation models. We give a rigorous proof that these 
models can reproduce exactly the results of the quantum 
theory of two 5=1/2 particles. 

In Section IVIIIi we propose a principle to derive 
the probability distributions of quantum theory of the 
EPRB experiment by using the algebra of probable in- 
ference [33, 113] , that is the axioms of probability theory, 
without making recourse to quantum theory. Our con- 
clusions are summarized in Section IIXI 



II. EPRB EXPERIMENTS 
A. Spin 1/2 particles 

Many experimental realizations and quantum theoret- 
ical descriptions of the EPR gedanken experiment psj 
adopt the model proposed by Bohm f3|. A schematic di- 
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FIG. 2: Schematic diagram of an EPRB experiment with 
magnetic particles [^]. 

agram of the EPRB experiment is shown in Fig. [21 A 
source emits charge-neutral pairs of particles with op- 
posite magnetic moments. The two particles separate 
spatially and propagate in free space to an observation 
station in which they are detected. As the particle ar- 
rives at station i = 1,2, it passes through a Stern-Gerlach 
magnet [soj . The magnetic moment of a particle inter- 
acts with the inhomogeneous magnetic field of a Stern- 
Gerlach magnet. The Stern-Gerlach magnet deflects the 
particle, depending on the orientation of the magnet and 
the magnetic moment of the particle. The Stern-Gerlach 
magnet divides the beam of particles in two, spatially 
well-separated parts [SOj . The observation that the beam 
splits into two, and not in a continuum of beams is in- 
terpreted as evidence that the particles carry a magnetic 
moment that can take two discrete values; it is quan- 
tized [13] ■ In quantum theory, we describe such a mag- 
netic moment using 5 = 1/2 operators. By changing 
the orientation of the Stern-Gerlach magnet, we change 
the direction of the plane that divides the two beams of 
particles. In quantum theory language, we say that the 
quantization axis is determined by the orientation of the 
Stern-Gerlach magnet. As the particle leaves the Stern- 
Gerlach magnet, it generates a signal in one of the two 
detectors. The firing of a detector corresponds to a de- 
tection event. 

Charge-neutral, magnetic particles that pass through 
a Stern-Gerlach magnet not only change their direction 
of motion but also experience a time-delay, depending on 
the direction of their magnetic moment, relative to the 
direction of the field in the Stern-Gerlach magnet. The 
time-delays in Stern-Gerlach magnets are used to per- 
form spectroscopy of atomic size magnetic clusters [Slj 
and atomic interferometry (s^ . 

Real experiments require a criterion to decide which 
events, registered in stations 1 and 2, correspond to the 
detection of particles belonging to a pair (a single two- 
particle system). In EPRB experiments, this criterion is 
the coincidence in time of the events [29l . [3^ . Issj , as is 
most clearly illustrated by the EPRB experiments that 
use the photon polarization as a two-state system [U [2^, 
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B. Photon polarization 

In Fig. [21 we show a schematic diagram of an EPRB 
experiment with photons (see also Fig. 2 in [s^). Here, 
a source emits pairs of photons with opposite polariza- 
tion. Each photon of a pair propagates to an observation 
station in which it is manipulated and detected. The 
two stations are separated spatially and temporally [33 |. 
This arrangement prevents the observation at station 1 
(2) to have a causal effect on the data registered at sta- 
tion 2 (1) |22|]. As the photon arrives at station i — 1,2, 
it passes through an electro-optic modulator that rotates 
the polarization of the photon by an angle depending on 
the voltage applied to the modulator. These voltages 
are controlled by two independent binary random num- 
ber generators. As the photon leaves the polarizer, it 
generates a signal in one of the two detectors. The sta- 
tion's clock assigns a time-tag to each generated signal. 
Effectively, this procedure discretizes time in intervals of 
a width that is determined by the time-tag resolution 
r (3^ . In the experiment, the firing of a detector is re- 
garded as an event. 

As light is supposed to consist of non-interacting pho- 
tons, it is not unreasonable to assume that the individual 
photons experience a time delay as they pass through the 
electro-optic modulators or polarizers. Indeed, according 
to Maxwell's equation, in the optically anisotropic mate- 
rials used to fabricate these devices, plane waves with dif- 
ferent polarization propagate with different velocity and 
are refracted differently 54]. 

It is clear that, at least conceptually, the EPRB ex- 
periments with photons or massive 5 = 1/2 particles are 
very similar. 

C. Idealized experiments 

As it is one of the goals of this paper to demonstrate 
that it is possible to reproduce the results of quantum 
theory (which implicitly assumes idealized conditions) for 
the EPRB gedanken experiment by an event-based sim- 
ulation algorithm, it would be logically inconsistent to 
"recover" the results of the former by simulating nonideal 
experiments. Therefore, in this paper, we consider ideal 
experiments only, meaning that we assume that detectors 
operate with 100% efficiency, clocks remain synchronized 
forever, the "fair sampling" assumption is satisfied [5^, 
and so on. We assume that the two stations are separated 
spatially and temporally such that the manipulation and 
observation at station 1 (2) cannot have a causal effect on 
the data registered at station 2 (1). Furthermore, to real- 
ize the EPRB gedanken experiment on the computer, we 
assume that the orientation of each Stern-Gerlach mag- 
net or electro-optic modulator can be changed at will, at 
any time. Although these conditions are very difhcult to 




FIG. 3: Schematic diagram of an EPRB experiment with pho- 
tons m. 



satisfy in real experiments, they are trivially realized in 
computer experiments. 

D. Particle source 

In general, on logical grounds (without counterfactual 
reasoning), it is impossible to make a statement about 
the directions of the spin (or polarization) of particles 
emitted by the source unless we have performed an ex- 
periment to determine these directions. Of course, in a 
computer experiment we have perfect control and we can 
select any direction that we like. Conceptually, we should 
distinguish between two extreme cases. In the first case, 
we assume that we know nothing about the direction of 
the spin (or polarization). We mimic this situation by 
using pseudo-random numbers to select the directions. 
This is the case that is typical for an EPRB experiment 
and we will refer to it as Case I. In the second case, ref- 
ered to as Case II, we assume that we know that the 
directions of both spins (or polarizations) are fixed (but 
not necessarily the same). A simulation algorithm that 
aims to reproduce the results of quantum theory of two 
S = 1/2 particles should be able to reproduce these re- 
sults for both Case I and II, without any change to the 
simulation algorithm except for the part that simulates 
the source. 



E. Data gathered in an EPRB experiment 

Here and in the sequel, we use the EPRB experiment 
with 5=1/2 particles as the primary example. The case 
of EPRB experiments that use the photon polarization 
can be treated in exactly the same manner, replacing 
three-dimensional unit vectors by two-dimensional ones 
and so on. 

In the experiment, the firing of a detector is regarded 
as an event. At the nth event, the data recorded on 
a hard disk at station z = 1,2 consists of Xn^i = ±1, 
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specifying which of the two detectors fired, the time tag 
tn^i indicating the time at which a detector fired, and 
the unit vector a„.i that specifies the direction of the 
magnetic field in the Stern-Gerlach magnet. Hence, the 
set of data collected at station i = 1,2 during a run of N 
events may be written as 



n,i7 ^n.z I 



1, 



,N}. 



(2) 



In the (computer) experiment, the data {Ti, T2} may 
be analyzed long after the data has been collected [32 |. 
Coincidences are identified by comparing the time dif- 
ferences {ira.i — tn^2\n — 1, • ■ • , N} with a time window 
W [3^. Introducing the symbol to indicate that the 
sum has to be taken over all events that satisfy — SLn^i 
for i = 1,2, for each pair of directions ai and a.2 of 
the Stern-Gerlach magnets, the number of coincidences 
Cxy = Cxy{sii,a2) between detectors D^^i {x = ±1) at 



station 1 and detectors Dy^2 
given by 



(y = ±1) at station 2 is 



N 



C^y = 5^,.,^Jy,.^M^ ^\tnS-tr,a\), (3) 



where Q{t) is the Heaviside step function. We emphasize 
that we count all events that, according to the same cri- 
terion as the one employed in experiment, correspond to 
the detection of pairs. 

The average single-particle counts are defined by 



and 



£;i(ai,a2) = 



£'2(ai,a2) 



'^x,y=±l ^xy 



'}lx,y=±l V^xy 



(4) 



where the denominator is the sum of all coincidences. Ac- 
cording to standard terminology, the correlation between 
a; = ±1 and y — ±1 events is defined by [ssj 



p(ai,a2) 



^ - a; , y ^ ^ y 



^i.V ^'^!CV SZx.y y'-'^ 



Ylx,y ^xy 



^2 1 C^^.v 



y^C^ 



,yc. 



(5) 



The correlation p(ai,a2) is +1 ( — 1) in the case that x — 
y [x — —y) with certainty. If the values of x and y are 
independent, the correlation p(ai,a2) is zero, but the 
converse is not necessarily true. 

In the case of dichotomic variables x and y, the cor- 
relation p(ai,a2) is entirely determined by the average 
single-particle counts Eq. ^ and the two-particle aver- 



age 



£^(ai,a2) 



'^x,y Xy^xy 
'^x,y ^^y 

C-i--)- -l- C — — C-i — — C*— H 



(6) 



C++ + C__ C+_ c_+ 
For later use, it is expedient to introduce the function 

5(a,b,c,d) =i;(a,c)-i;(a,d) + £;(b,c)-|-i;(b,d), (7) 
and its maximum 



Smax = max S'(a, b, c,d). 

a,b,c,d 



(8) 



In general, the values for the average single- 
particle counts iJi(ai,a2) and £^2(^1, 3-2) the coinci- 
dences C2:y(ai,a2), the two-particle averages i?(ai,a2). 



5(a, b, c, d), and Smax not only depend on the directions 
ai and a2 but also on the time-tag resolution r and the 
time window W used to identify the coincidences. 



F. Role of the time window 

Most theoretical treatments of the EPRB experiment 
assume that the correlation, as measured in the experi- 
ment, is given by [l3| 



N 

Xy / J "x,Xn.l"y,Xn,21 



(9) 



which we obtain from Eq. ([3]) by taking the limit W — s- 
00. Although this limit defines a valid theoretical model, 
there is no reason why this model should have any bear- 
ing on the real experiments, in particular because exper- 
iments pay considerable attention to the choice of W . A 
rational argument that might justify taking this limit is 
the hypothesis that for ideal experiments, the value of 
W should not matter. However, in experiments a lot of 
effort is made to reduce (not increase) W [H, [s^ . 
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FIG. 4: Smax as a function of the time window W, computed 
from the data sets contained in the archives Alice.zip and 
Bob. zip that can be downloaded from Ref . [s^. Bullets (red): 
Data obtained by using the relative time shift A = 4 ns that 
maximizes the number of coincidences. Crosses (blue): Raw 
data (A = 0). Dashed line at 2y/2: Smax if the system is 
described by quantum theory (see Section [Ilip . Dashed line 
at 2: Smax if the system is described by the class of models 
introduced by Bell [3]. 



As we will see later, using our model it is relatively 
easy to reproduce the experimental facts and the results 
of quantum theory if we neglect contributions that are 
0{W^). Furthermore, keeping W arbitrary does not 
render the mathematics more complicated so there re- 
ally is no point of studying the simplified model defined 
by Eq. ([5]): We may always consider the limiting case 
W oo afterwards. 



G. Case study: Analysis of experimental EPRB 
data 

It is remarkable that all textbook treatments of the 
EPRB experiment assume that the experimental data 
is obtained by using Eq. This is definitely not the 
case [Hjllll. We illustrate the importance of the choice of 
the time window W by analyzing a data set (the archives 
Alice.zip and Bob. zip) of an EPRB experiment with pho- 
tons that is publically available fSTj. 

In the real experiment, the number of events detected 
at station 1 is unlikely to be the same as the number of 
events detected at station 2. In fact, the data sets of 
Ref. [13 show that station 1 (Alice.zip) recorded 388455 
events while station 2 (Bob. zip) recorded 302271 events. 
Furthermore, in the real EPRB experiment, there may be 
an unknown shift A (assumed to be constant during the 
experiment) between the times gathered at station 1 
and the times tn,2 recorded at station 2. Therefore, there 
is some extra ambiguity in matching the data of station 
1 to the data of station 2. 

A simple data processing procedure that resolves this 



ambiguity consists of two steps [5g|. First, we make a 
histogram of the time differences tn,i — tm,2 with a small 
but reasonable resolution (we used 0.5 ns). Then, we fix 
the value of the time-shift A by searching for the time 
difference for which the histogram reaches its maximum, 
that is we maximize the number of coincidences by a 
suitable choice of A. For the case at hand, we find A = 
4 ns. Finally, we compute the coincidences, the two- 
particle average, and Smax using the expressions given 
earlier. The average times between two detection events 
is 2.5 ms and 3.3 ms for Alice and Bob, respectively. The 
number of coincidences (with double counts removed) is 
13975 and 2899 for (A = 4 ns, = 2 ns) and (A = , 
W — 3 ns) respectively. 

In Figs, m and O we present the results for Smax as a 
function of the time window W. First, it is clear that 
Smax decreases significantly as W increases but it is also 
clear that as W 0, Smax is not very sensitive to the 
choice of W [s^. Second, the procedure of maximizing 
the coincidence count by varying A reduces the maxi- 
mum value of Smax from a value 2.89 that considerably 
exceeds the maximum for the quantum system {2^/2, see 
Section to a value 2.73 that violates the Bell inequal- 
ity and is less than the maximum for the quantum sys- 
tem. 

The fact that the "uncorrected" data (A = 0) violate 
the rigorous bound for the quantum system should not 
been taken as evidence that quantum theory is "wrong" : 
As we explain later, it merely indicates that the way in 
which the data of the two stations has been grouped in 
two-particle events is not optimal. Put more bluntly, 
there is no reason why a correlation between similar but 
otherwise unrelated data should be described by quan- 
tum theory. In any case, the analysis of the experimen- 
tal data shows beyond doubt that a model which aims to 
describe real EPRB experiments should include the time 
window W and that the interesting regime is 0, not 

— + cxD as is assumed in all textbook treatments of the 
EPRB experiment. In Sections fVl and IVlIl we show that 
our simulation models reproduce the salient features of 
Figs, m and [5] quite well if contributions that are 0{W^) 
can be neglected. 



III. QUANTUM THEORY 

In this section we briefly review some well-known re- 
sults for the quantum theory of a system of two = 1/2 
particles and we give a brief account of the quantum the- 
oretical description of Case I and Case II. In quantum 
theory, the state of a system of two S" = 1/2 objects is 
described by a 4 x 4 density matrix p (2]. The average 
value of a dynamical variable, represented by the 4x4 
matrix X is {X) = TrpX Q- According to the axioms 
of quantum theory 0, repeated measurements on the 
two-particle system described by the density matrix p 
yield statistical estimates for the single-particle expecta- 
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FIG. 5; Same as Fig. |4] except for the range of W. Bullets 
(red): Data obtained by using the relative time shift A = 4 ns 
that maximizes the number of coincidences. The maximum 
value of Smax ~ 2.73 is found &t W = 2 ns. Crosses (blue): 
Raw data A = 0. The maximum value of Smax ~ 2.89 is 
found at 14^ = 3 ns. 



tion values 

h{si) = {<7^■a), (10) 

for 1 = 1,2 and the two-particle correlations 

i?(a,b) = (cri -affz -b), (11) 

where ai — (erf , , erf ) are the Pauli spin- 1/2 matrices 
describing the spin of particle i = 1, 2 2], and a and b are 



unit vectors. We introduce the notation to distinguish 
the quantum theoretical results from the results obtained 
by analysis of the data {Ti, T2}. 

If the density matrix of the quantum system factor- 
izes, p = pi ® P2, where pi is the 2x2 density matrix 
of particle i. Then ^^(a, b) = Ei{a)E2(h) and the cor- 
relation p(ai,a2) = ^^(a, b) — £'i(a)£'2(b) — 0. Hence, 
p = Pi (8) y02 is called the uncorrelated quantum state. 

Let us denote a = i?i(a), b — i?i(b), and c — i?2(c), 
d — E2 (d) , such that S'(a, b, c, d) = ac — ad + hc + hd. 
Clearly, o, b,c,d E [—1, 1]. For any a, b,c,d E [—1, 1] we 
have [42| 

\ac — ad + be + bd\ < \ac — ad\ + \bc + bd\ 

< |a||c-d| + |6||c + d| 

< \c~d\ + \c + d\ 

< 1-cd+l + cd 

< 2. (12) 

Thus, we conclude that if the quantum system is in the 
uncorrelated state we must have 



Sraax = max 5(a, b, c, d) < 2. (13) 

a.b,c,d 



If the density matrix p does not factorize, the upper- 
bound to Smax can be found as follows Using the 
algebraic properties of the Pauli-spin matrices, a simple 
calculation yields, 



(cti • a (72 • c — (Ti • a (72 • d + (7i • b (72 • c + (Ji • b (72 • d)^ = 4 + 4(7i • (a X b) (72 • (c X d) 



(14) 



Noting that TrpX^Y defines an inner product on the vector space of 4 x 4 matrices X and Y, making use of the fact 
that p is positive semi-definite and that Trp = 1, we have 



\TrpX\^ = Trpi/2pi/2^ < Trip'^^)^ p'^^Tr{p'/^X)^ p'/^X = TrpX^ X. 

For X = X^ = CTi • a (72 • c — (7i • a (72 • d + (7i • b (72 • c -t- (7i • b (72 • d, Eq. becomes 

[E{a,c) - E{a,d) + E(h,c) + E{h,d)f < 4 + 4i?(a x b, c x d). 



(15) 



(16) 



As the eigenvalues of cri - a CT2 -b are ± a-b, and since a, b, 
c, and d are unit vectors, we have \E{a x b,c x d)| < 1. 
Hence H 

|5(a,b,c,d)| < 2V2, (17) 

independent of the quantum state p. According to 
Eqs. ini) and ((T71), if 2 < S^ax < 2\/2 the quantum sys- 



tem is in a correlated state, that is p ^ pi® P2- For pure 
states (Trp^ — 1), the converse is also true [5§| but, for 
general states p it is not [13, [61I, [12] ■ If, in an experiment 
or simulation, we would find that Smax > 2-\/2, the re- 
sults of this experiment or simulation cannot be described 
by the quantum theory of a system of two S* = 1/2 par- 
ticles. 
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We now examine the examples of a maximally corre- 
lated (entangled) quantum state (called Case I) and the 
uncorrelated quantum state (called Case II) in more de- 
tail. 



A. Case I: Singlet state 

The quantum theoretical description of the EPRB ex- 
periment assumes that the state of the two spin- 1/2 par- 
ticles is described by the singlet state p — where 
I*) = (I Ti) - I iT))/\/2 and I T) (I i)) is the eigen- 
state of cr^ with eigenvalue +1 (—1)- For the singlet 
state, the single-particle expectation values and the two- 
particle correlations are given by 

^,(a,;) - (*|a,;.a,|*) -0 ; ^ = l,2, (18) 



and 



£;(ai,a2) = (*|cri -ai 0-2 -aal*) = -ai -aa, (19) 



respectively. A simple calculation shows that Smax = 
2\/2, in other words, the singlet state satisfies Eq. (fl7|l 
with equality. 

For the singlet state, the probability P{x, y|ai, ^2) that 
we observe a pair of events x,y = ±1 under the (fixed) 
condition (ai , a2 ) is given by 



P(a;,y|ai,a2) 
from which it follows that 



1 - xyai • a2 



(20) 



P(a:|ai,a2) = ^ P(x, y|ai, a2) = 1/2, 



y=±l 



P(y|ai,a2) = ^ P(a:, y|ai, a2) = 1/2, (21) 



x=±l 



and 



^ P(a;,y|ai,a2) = 1, 
^ a;P(a;,y|ai,a2) = 0, 

it 1 

X! y^(2;,2/|ai,a2) = 0, 

±1 

^ a;?;P(a;,y|ai,a2) = -ai • a2, (22) 

in agreement with the second column of Table |T1 

In the quantum theoretical description, the state of 
the two spin-1/2 particles may be correlated (p(ai, a2) = 
i?(ai,a2)), even though the particles are spatially and 
temporally separated and do not necessarily interact. 



TABLE I: Quantum 


system of two S — 


1/2 obiects: 


The 


expectation values in 


the singlet state (C 


ase I) and in 


the 


product state (Case II). 








Case I 


Case II 




Si(ai) 





ai ■ Si 




|2(a2) 





a2 • S2 




-E(ai,a2) 


— ai • a2 


(ai ■ Si)(a2 ■ 


S2) 



B. Case II: Spin-polarized state 

In Case II, p = pi ® P2 where pj — \dj<j>j){6j(j)j\ and 
\ej(j)j) = cos(6»j/2)| t) + e^f"^ sin(6»j/2)| i) for j = 1,2. A 
straightforward calculation shows that 

E,{a,) = a, -S,; i = 1,2, (23) 
^(ai,a2) = ^i(ai)^2(a2), (24) 

where Si — (cos (f)i sin 9i , sin (jji sin 9i , cos 6i). 

For the product state, the probability 
P(a;, y|ai, a2. Si, S2) that we observe a pair of events 
x,y = ±1 under the (fixed) condition (ai, a2. Si, S2) is 
given by 

ui \ c G ^ 1 + a^ai ■ Si 1 + ya.2 ■ S2 , . 
P(a;,y|ai,a2,Si,S2) = , (25) 

and yields expectation values that are in agreement with 
the third column of Table HI Obviously, for the spin- 
polarized state p(ai, a2) = i?(ai, a2) — -Bi(ai)£'2(a2) — 0, 
hence there is no correlation in this case. 



C. Photon polarization 

In the quantum theoretical description of Case I, the 
whole system is described by the state 

1*) = m^\Vh \V)i\H)2) = {\HV) \VH)) , 

(26) 

where H and V denote the horizontal and vertical po- 
larization and the subscripts refer to photon 1 and 2, re- 
spectively. The state |^') cannot be written as a product 
of single-photon states, hence it is an entangled state. 

In Case II, the photons have a definite polarization rji 
and r]2 when they enter the observation station. The po- 
larization of the two photons is described by the product 
state 

I*) =(cos77i|H)i + sin ?7i|y)i)(cos 7721^)2 + sin ?72|1^)2)- 

(27) 

Using the fact that the two-dimensional vector space with 
basis vectors {\H), \ V)} is isomorphic to the vector space 
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TABLE II: Quantum system of two photon polarizations: The 
expectation values in the singlet state (Case I) and in the 
product state (Case II) where cos 9i = ai • Si , cos 62 =32-82, 
and cos 61,2 = Si ■ S2. 





Case I 


Case II 


£i(ai) 





cos 28i 







cos 202 


-E(ai, a2) 


— cos 26'i,2 


cos 26i cos 202 



of spin-1/2 particles, we may use the quantum theory of 
the latter to describe the EPRB experiments with pho- 
tons. The resulting expressions for the averages are given 
in Table |TT1 They are similar to those of the genuine 
S — 1/2 problem except for the restriction of ai and a2 
to lie in planes orthogonal to the direction of propagation 
of the photons and the factor of two that multiplies the 
angles. The latter reflects the fact that the polarization 
is defined modulo tt, not 27r as in the case of S* = 1/2. 



IV. RELATING QUANTUM THEORY AND 
DATA 

There is no doubt that quantum theory is very suc- 
cessful in describing a vast amount of phenomena in 
which we observe the ensemble average of many mea- 
surements that are repeated under the same external 
conditions 0, H. The EPRB experiments seem to be 
no exception: The analysis of the experimental data ac- 
cording to the procedure discussed earlier, demonstrates 
that £;(ai , aa ) «:! -B(ai , aa) [H, m, m, 113, H H [m m . 

On the other hand, as is well known from the early days 
of quantum mechanics, quantum theory itself has noth- 
ing to say about the individual events (quantum mea- 
surement paradox) 0, 0] ■ The very concept of an event 
cannot be reconciled with quantum theory 0, [3] . 

In this section, we elaborate on the relation between 
quantum theory and (experimental) data. 



A. From quantum theory to experimental data 

The fundamental problem of relating the object in the 
mathematical formalism of quantum theory to experi- 
mental facts may be solved by (1) interpreting the state 
of the system as the probability distribution for events 
to occur and by (2) supplementing quantum theory by 
a Bernouilli process [37, 38] that generates logically in- 
dependent events according to the prescribed probability 
distribution, the so-called measurement postulate. Thus, 
we have 

Quantum theory + Bernouilli process =J> Events. (28) 



All treatments of quantum theory that we are aware of 
turn the logical implication Eq. around, without any 
justification and declare all quantum events to be uncor- 
related random. Of course, it might be the case that 
the analysis of experimental data supports the hypothesis 
that the events are generated as Bernoulli trials. How- 
ever, there is rather compelling experimental evidence 
that successive events are correlated 63] . Notwithstand- 
ing this, using Eq. (|28|) we are in the position to use 
quantum theory and discuss events in a mathematically 
well-defined context. 

For simplicity, in the example of the EPRB experi- 
ment, we focus on the case where ai and a.2 are fixed 
in time. Let us then inquire how we can simulate the 
quantum theoretical results of the EPRB experiment (see 
Table I) using the procedure laid out by Eq. (|28p . 

According to the axioms of quantum theory, in each 
event we observe only one of the eigenvalues of the dy- 
namical variable that is being measured . For the case 
at hand, the eigenvalues of cti -ai, (T2 -aa, and ai -ai era -aa 
are ±1. Then, according to Eq. ([28|l . what is left to do 
is to imagine three Bernoulli processes that generate sets 
of data Q — {an = ±1, bn = ±1, c„ = ±l]n = 1, . . . , N} 
such that for a sufficiently large number of events N, 



— ^a„ « -Ei(ai), 

n=l 
1 ^ 

1 ^ 

-^c„ « i?(ai,a2), (29) 



for all ai and 3.2, the expressions for £'i(ai), i?2(a2) and 
i?(ai,a2) being given in Table H The fact that we use 
Bernouilli processes in which every trial is drawn from the 
same probability distribution guarantees, by the law of 
large numbers, that the average over all events converges 
with probability one to the ensemble average [13, ISa], 
which in the present case is given by quantum theory. 
Note that quantum theory does not impose any relation 
(correlation) between the numbers a„, 6„, and c„, other 
than that Eq. ^ should hold. 

In general, generating data {x, y) according to the 
probability distributions Eqs. (PO)) and is a nearly 
trivial exercise. Once we have solved the quantum me- 
chanical problem, that is, once we have the explicit form 
of the wave function, constructing a Bernoulli process 
that generates events according to the explicit form is a 
simple task. In practice, we assume that the pseudo- 
random number generator that we employ produces 
Bernoulli trials, a hypothesis that cannot be justified in 
a mathematically strict sense. 
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B. Fundamental problem 

Let us now try to relate the quantum theoretical ex- 
pectation values that appear in Eqs. PTI)1 and (jlip to the 
actual data. In general, the probability for observing a 
pair of dichotomic variables {x, y} can be written as 



A consistent application of the postulates of quantum 
theory yields 



P(a;,y|a,b) 



1 + xE^ (a, b) + yEy (a, b) + xyE^y (a, b) 



(30) 

from which, by the standard rules of probability theory, 
it follows that 

P,(x|a,b) ^ ^ P(^,j,|a,b) = i±^^l^,(31) 

y=±l 

P,(y|a,b) ^ ^ P(^,y|a,b) = i±^^|^.(32) 

x=±l 

By definition, x and y are logically independent if and 
only if P(x,2/|a,b) = Pj,(a;|a, b)P^(2/|a, b) @,[33,[3l. If 
X and y are logically independent it is easy to show that 
Exy = ExEy. In general, the converse is not necessarily 
true 0, [33, [sll but in this particular case it is. Indeed, 
if Exy = E^Ey, if follows directly from Eq. ([50]) that 
P(x,y\a,h) = Pj,(2:|a, b)Pa;(y|a, b). Thus, for the case 
we are treating here, E^y ^ E^Ey if and only if x and y 
are logically dependent. 

In quantum theory, we have two different cases also. 
If the density matrix of the two spin-1/2 particle quan- 
tum system factorizes (Case II), we have (cti • a (T2 • b) — 
(cti • a) ((72 • b) and the state of the system is completely 
characterized by £'i(a) and E2{h). However, if the den- 
sity matrix does not factorize (Case I), a complete charac- 
terization of this entangled state requires the knowledge 
of Ei{a.), E2{h), and i?(a, b). Upto this point, it seems 
that there is full analogy with the probabilistic model of 
the data, but we still have to relate the quantum theo- 
retical expressions to the observed data. 

To this end, we invoke the postulate that states that 
the possible values of a dynamical variable in quantum 
theory are the eigenvalues of the linear operator that cor- 
responds to this variable For the case at hand, the 
operators are cti -a, (T2 'b, and cti -a (T2 'b, with eigenvalues 
X = ±1, y = ±1 and z = ±1, respectively. 

It is evident that the triples {x, y, z} cannot repre- 
sent the data Eq. ^ that is recorded and analyzed in 
real EPRB experiments [H, [H, [13, [H, [s^, [Ml, [ll : The 
quantum mechanical model is trivially incomplete in that 
it has no means to describe the time-tag data. But, 
quantum theory is incomplete in a more fundamental 
sense [l,[2l]. 

First, let us consider an experiment that produces z 
only. In general, the probability to observe z can be 
written as 



P(£|a,b) = 



1 + zE{&, b) 



(34) 



and we would use i?(a, b) — Ez{sl, b) to relate the theo- 
retical result to the data. Likewise, we could imagine an 
experiment that produces x (y) and use Ei (a) = E^ (a, b) 
(i?2(b) = Ey{aL,h)) to relate the theoretical description 
to the data. 

Second, we ask whether it is possible to describe by 
quantum theory, an experiment that yields the data 
{x,y}. According to the postulates of quantum theory, 
the probabilities for the eigenvalues to take the values 
{x, y} are given by 



P{x\a,h) = 
P(y|a,b) = 



l + xEi{a.) 



l + yE2{h) 



(35) 
(36) 



where x, and y are logically independent random vari- 
ables, that is each measurement of a dynamical variable 
constitutes a Bernouilli trial [3]. Then, we would use 
Ei{a) = Erc{a,h) and E2{h) = Ey{a,h) to relate the 
theory to the data. 

But the real data is {x,?/}, not the logically indepen- 
dent random variables {x, y} of the mathematical model. 
Therefore the quantum theoretical description of an ex- 
periment that yields {x, y} is necessarily incomplete if 
the data is such that E^y 7^ ExEy. 

The fact that EPRB experiments show good agree- 
ment with the quantum theory of two 5=1/2 objects 
is not in conflict with this reasoning: In real EPRB ex- 
periments, the coincidences are computed according to 
Eq. ([2]), which includes the time-tag information, about 
which quantum theory has nothing to say. Hence there 
is no logical inconsistency. 



C. From experimental data to quantum theory 

Let us now turn things around and ask the much more 
interesting question how we, as observers, relate the ob- 
served data of an EPRB experiment to quantum theory. 
To simplify the discussion, we assume that the directions 
ai and a2 are fixed. Thus, we start from the data set 

£ = {Xn,l,Xn,2,tn,l,tn,2\n=l,. . . ,N}, (37) 

and ask the question how to relate these numbers to the 
set of data 



(38) 



P(z|a,b) 



1 + zE ^{a,h) 



(33) 



Q = {a„, 6„, Cn\n = 1, . . . , N}, 



that we obtained by adopting the procedure Eq. ((28)) . 

It is not difficult to see that there are no a-priori rules. 
How could there be rules? In general, there is no guar- 
antee that the data £ that resides on the hard disk of the 
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experimenter's computer has been produced by a physi- 
cal system and not by, for instance, a bug in the operating 
system that is controlhng the computer. Moreover, for 
bonafide experimental data, it should not matter who 
carries out the data analysis: Once the data has been 
recorded and there is agreement on the procedure to an- 
alyze this data, the results (but not necessarily the sub- 
jective conclusions) should not depend on whether or not 
the individual that performs the data analysis "knows" 
about quantum theory. The following example may be 
useful to understand the conceptual problem. 



1. Relating frequencies to probabilities 

Let us consider the experiment in which we toss a 
coin TV times. The set of N observations looks like 
{H, H,T, . . .} where H and T denote head and tails, re- 
spectively. From the set of data, we find that the number 
of times that the coin ends up with tails on the floor is 
h. Thus, the frequency with which we observe head is 
then / = h/N, which clearly is a well-defined number. 
Little thought shows that without any further knowl- 
edge/assumption about the experiment, that is all we 
can say (of course, we could calculate correlations be- 
tween events and so on but this does not change the 
essential point of the discussion). 

Imagining that we can continue the experiment for- 
ever does not help either because lim^^oo h/N is not 
well-defined [s^, [H, liO] . Indeed, it may happen that we 
never observe heads or always observe heads. If, in our 
description of the experiment, we would like to go be- 
yond just giving the numbers {hi,Ni) for « = 1, . . . ,M 
repetitions of the experiment, we have to make additional 
assumptions. 

Implicit in the interpretation of most scientific exper- 
iments is the assumption that there is some underlying 
process that generates the data. In the simple case of the 
coin, assuming Newton's law holds, solving the equations 
of motion allows us to predict the outcome of each indi- 
vidual toss [3^ . This outcome depends on how well we 
know the initial conditions, the precise form of the force 
field and so on. 

If a description on the level of individual events seems 
too complicated, or if we do not have enough knowledge 
to describe the whole experimental situation (as in the 
case of the coin), it is customary to postulate that there 
is some underlying probabilistic process that determines 
the frequency with which the events will be observed. 

It is instructive to see how the process of reasoning 
works in the case of the coin (the use of quantum the- 
ory to describe observed phenomena requires the same 
logic). As usual, the simplest probabilistic model for the 
outcome of the experiment of tossing the coin, assumes 
that (1) there is a probability p to observe heads and 
that (2) this probability is logically independent of what 
happens at other tosses. Now, these are nice words but, 
in the absence of any experimental data, what do they 



mean? 

The probability p is a mathematical concept that we 
use to encode, by a real number in the interval [0,1], 
our state of knowledge about the problem [s^, • The 
statement that this probability is logically independent 
of what happens at other tosses cannot be expressed in 
terms of frequencies [13, IH, . It is an hypothesis that 
we make without knowing what the frequencies and cor- 
relations between the events will be. Once we have col- 
lected the experimental data, we may compute the prob- 
ability for this hypothesis to be true or not and we may 
also use the observed frequency to assign a value to the 
probability p HI,!!!!!!. 

From a logical and conceptual point of view, it is ex- 
tremely important to realize that the first step is to define 
the concept of " probability" through the Kolmogorov set 
of axioms or through the more general inductive logic ap- 
proach (see also Section IVl)) [37]. Then, and only then, it 
may make sense to use the observed frequency to assign 
a number to the probability for an event to occur. We 
continue with the example of the coin to illustrate this 
point. 

Now imagine a thought experiment {— a mental con- 
struct) in which we toss the coin N times. Note that 
in a strict mathematical sense, the mathematical model 
cannot be simulated by an algorithm on a digital com- 
puter, which by construction is a deterministic machine. 
Of course, using pseudo-random numbers, we can sim- 
ulate events that are unpredictable to anyone who does 
not know the initial state or the pseudo-random number 
generator algorithm. The mathematical model can then 
be used to test whether it describes the global features 
(but not the individual events) well. 

A direct, constructive proof that probabilities are de- 
fined through frequencies would be to invent a practical 
procedure (algorithm) that simulates the tossing of the 
coin such that the probability for head is exactly p and 
such that each toss is logically independent from all oth- 
ers. Such an algorithm does not exist: The concept of 
probability is a mental construct and has no meaning in 
the realm of algorithms that generate events but, that 
does not imply that the concept of probability would be 
useless for describing some of the features of the data 
generated by these algorithms. 

In essence, we are repeating what has been said in the 
introduction. Looking back at the diagram in Fig. [U 
the mathematical model is located at the right hand side 
(model space). The mathematical model itself does not 
"produce" events. This is done by some algorithm (data 
space). We can test the various hypotheses that under- 
pin the mathematical model by calculating expectation 
values (ensemble averages in the case of the coin) and, 
using the mathematical machinery of probability theory, 
compute the probability that these hypotheses are cor- 
rect. Let us now see how this works in the case of the 
coin. 

According to the assumed mathematical model, the 
probability to observe k heads and N—k tails in a thought 
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experiment involving N tosses is given by [37|, [381, |4i 



Pik\N,Z) 



Nl 



kl{N-k) 



N-k 



(39) 



where Z represents all other knowledge about the ex- 
periment not contained in k and TV [s^, E^l- If m 
denotes the number of heads such that P{m\N, Z) = 
maxfe P{k\N, Z), we have 



P{m\N,Z) __ (m + 1) 1 -p 
P(to+ l|iV, Z) ~ N-m p 



and 



P{m\N, Z) 
P{m- 1\N, Z) 



N -m+l p 



from which it follows that 



m 



N+1 



< v< 



1-p 



m + 1 
N + 1' 



> 1, 



> 1, 



(40) 
(41) 

(42) 



and 



In 1 m , 1 , , , 



According to our mathematical model, of all k — 
0, . . . , m, the value of k that has the largest probabil- 
ity to occur is m and from Eq. it follows that as 
N increases, m/N p. Of course, we can easily calcu- 
late other useful quantities such as the ensemble average 
(k/N) = N-^J2^^QkP{k\N,Z) = p and the variance 
((fc/7V)2) - {k/Nf = 7V-ip(l - P)- 

We now consider the real experiment in which we toss 
the coin and assign the value a;„ = 0, 1, if at the nth toss, 
we observe tail or head, respectively. The frequency of 
heads is then / = J2n=i "^^^ next logical step 
is to assume that the mathematical model, described 
above, is valid. Then, for the most likely experiment 
(the one that occurs with the largest frequency) we have 
(1 + N-'^)p - N-'^ < / < (1 + N-'^)p. Furthermore, 
the ensemble average of each event Xn becomes a mean- 
ingful concept and, if we compute the ensemble average 
of the frequency, we find (/) = p a.s naively expected. 
Thus, it makes sense to use the observed frequency / for 
assigning a number to the symbol p in the mathemati- 
cal theory. At this point, the mathematical theory has 
been "connected" to the observed phenomena. Once this 
connection has been made, we can (and should) use the 
tools of probability theory to compute the probability 
that the assumptions of the mathematical model are cor- 
rect by confronting the mathematical results for various 
ensemble averages to the corresponding averages of the 
observed data. 

From this simple example, we see that in order to at- 
tach a meaning to the observed frequencies, we first need 
to introduce a mathematical model, probability theory in 
this case, and make the hypothesis that the outcome of 
the toss is determined by a Bernouilli process with prob- 
ability p. Only when this hypothesis has been made, it 
can be proven that the observed f req uency approaches p 
as iV ^ oo with probabihty one [lllli,!!^. Thus, the 



concept of probability and probability theory have to be 
introduced first. Only then we can use probability theory 
to relate the variables in the probabilistic model {p in the 
example of the coin) to the observed data. 

This simple example clearly shows that frequencies and 
probabilities have a different logical status [1, [13, lioj . 
Frequencies are the things that we observe (data space 
in Fig. [1]) and exhibit a causal dependence on the condi- 
tions under which the data is recorded. Probability the- 
ory is a well-defined mathematical model (model space 
in Fig. [T|that allows us to think in a rational, logical 
manner [37I |40| . Probabilities express logical relation- 
ships. A problem with the conceptual difference is that 
in many instances, simply using the frequency to assign 
a value to the probability works so well that we may be 
inclined to forget that there is a fundamental difference 
between the two. Although it is generally recognized that 
logical implication is not the same as physical causation, 
mixing up frequencies and probabilities leads to bizarre 
conclusions 0, M \M- As we discuss later, the mys- 
teries surrounding the EPR paradox and Bell's theorem 
dissolve if one recognizes that physical cause and logical 
dependence are fundamentally different concepts [37[. 



2. Relating EPRB data to quantum theory 

In the case of the EPRB experiment, we immediately 
see that we face the same fundamental problem if we go 
beyond the description of merely giving the data collected 
in the experiment. To make progress in understanding 
the behavior of the system as it is revealed to us by our 
(experimental) method of questioning, we have three op- 
tions: 



1. Use the established mathematical framework of 
probability theory to relate the quantities that ap- 
pear in this theory (probabilities) to experimentally 
observed facts (expected frequencies). 

2. Construct an event-based computer model that di- 
rectly generates the data set Eq. (157)) . with expec- 
tation values that agree with those of quantum the- 
ory 

3. Without relying on concepts of quantum theory, 
construct a probabilistic model that predicts the 
expected frequencies as observed in the experi- 
ment. 



As quantum theory has nothing to say about individual 
events 0, |3| , logically speaking option (2) cannot make 
any reference to quantum theory. Sections |V] and IVIIII 
are devoted to options (2) and (3), respectively. For now, 
we continue with option (1). 
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If we measure a property of a single particle, 
Eq. ([^5]) we naively expect that the assignment 



from 



^2 (as) 



xy 



12x.,y=±l ^xy 
'^x,y=±l y^xy 
'l2x,v=±l ^xy 



(44) 



holds with probability one. Note that Eq. ([44|) contains 
contributions from the events that fall within the coinci- 
dence window only. As explained earlier, for the assign- 
ments Eq. to make sense mathematically, we have to 
assume that there is an underlying probabilistic process 
that generates the data {xn^i\. The fact that quantum 
theory describes a very large variety of experimental data 
strongly suggests that the assignment Eq. makes a 
lot of sense. 

As explained earlier, for the quantum dynamical vari- 
able iTi • a ■ b, it is not clear at all how to relate its 
eigenvalue c„ to the data set Eq. ([57]) . What does it mean 
to measure a common property of a system of two par- 
ticles? Why is the time-tag data absent in the quantum 
theoretical description while it is of vital importance for 
the experiment? Evidently, we need a proper operational 
definition of "a system of two particles" in terms of the 
observed data. 

As it is our aim to reproduce the experimental results 
as well as the results of the quantum model for the exper- 
iment, it would be logically inconsistent to adopt a def- 
inition that is different from the one used in real EPRB 
experiments. Therefore, we should consider the assign- 
ment 



X^x.y ^yCxy 



(45) 



where the frequency to observe systems of two particles 
is given by 

/(ai,a2,iy) = ^E^-y 

x,y 
N 

= ^E'®(^-I*".i-*".2|). (46) 

n=l 

In Eq. (|46p . the coincidence in time enters because it 
is an essential ingredient in any EPRB experiment. The 
expression for the coincidence is an operational procedure 
to define precisely, in terms of the observed data, the 
meaning of the statement that two particles constitute a 
two-particle system. 



V. SIMULATION MODEL 

In this section, we take up the main challenge, the con- 
struction of locally causal (in Einstein's sense) processes 



that generate the data sets Eq. ([2]) such that they re- 
produce the results of quantum theory, summarized in 
Table I. 

A concrete simulation model of the EPRB experiment 
sketched in Fig. [2] requires a specification of the infor- 
mation carried by the particles, the algorithm that simu- 
lates the source and the observation stations, the Stern- 
Gerlach magnets, and the procedure to analyze the data. 
We now describe a computer simulation model that gen- 
erates the data {Ti, T2}, see Eq. From the specifi- 
cation of the algorithm, it will be clear that it complies 
with Einstein's criterion of local causality on the ontolog- 
ical level: Once the particles leave the source, an action 
at observation station 1 (2) can, in no way, have a causal 
effect on the outcome of the measurement at observation 
station 2 (1). 

In this section, we limit the discussion to systems of 
two 5=1/2 particles. The algorithm that simulates the 
EPRB experiments with photons, as well as the results of 
the simulations, are very similar to those presented here. 
A detailed account of the simulations for the photon sys- 
tem can be found elsewhere i34„ |35|] . 



A. Algorithm 

1. Source and particles 

As in the quantum theoretical treatment of the prob- 
lem, we will consider two different cases. In Case 

I, the source emits particles that carry a unit vector 
Sn/j = (— l)*"'"^(cosiy9„ sin6'„, sin(p„ sin(?„, cos6'„), repre- 
senting the magnetic moment (or spin) of the particles. 
The spin of a particle is completely characterized by cpn 
and cos 0„ , which we assume to be distributed uniformly 
over the interval [0, 27r[ and [-1,1], respectively. In Case 

II, the source emits particles that carry fixed unit vectors 



2. Observation station 

Prior to the data collection, we fix the number M 
of different directions of the Stern-Gerlach magnets. 
We use 4M pseudo-random numbers to fill the arrays 
(bi^i, bi^ji/) for z = 1,2 (in the photon experiments of 
Aspect et al. and Weihs aL, M = 2 [32]). 



3. Stern-Gerlach magnet 

The input-output relation of a Stern-Gerlach magnet 
is rather simple: For a fixed direction a^ of the field, the 
Stern-Gerlach magnet deflects a particle with magnetic 
moment S„_i in a direction that we label by Xn,i — ±1. As 
the particle travels through the Stern-Gerlach magnet, 
the magnetic moment of the particle changes from S„_i 

to Sj2_^ — XYi.i'^i- 
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According to the simple quantum mechanical model of 
the Stern-Gerlach experiment [2], for fixed S and fixed a,j, 
the probability to observe Xn,i = ±1 is (l±S-ai)/2. Thus, 
in this case, the simulation algorithm should generate the 
sequence Xn,i = ±1 such that 



1 



N 



N 



1™ ^I]^".' 

n=l 



= S • a, 



(47) 



with probability one. However, if the input consists of 
uniformly distributed S„^i, the sequence of output bits 
Xn,i = ±1 should satisfy 



1 



N 



lim — } Xn 



0, 



(48) 



with probability one, independent of the orientation a^ 
of the Stern-Gerlach magnet. We now consider two algo- 
rithms, a deterministic and a pseudo-random one, that 
simulate the operation of a Stern-Gerlach magnet. 

Deterministic model. Elsewhere, we have demon- 
strated that simple deterministic, local, causal and classi- 
cal processes that have a primitive form of learning capa- 
bility can be used to simulate quantum systems, not by 
solving a wave equation but directly through event-by- 
event simulation [1, 0, [ll|, [l3| • The events are generated 
such that their frequencies of occurrence agree with the 
probabilities of quantum theory. In this simulation ap- 
proach, the basic processing unit is called a deterministic 
learning machine (DLM) [1, i, [HI, [H [sl . 

A DLM is a device that exchanges information with 
the particles that pass through it. It learns by comparing 
the message carried by an event with predictions based 
on the knowledge acquired by the DLM during the pro- 
cessing of previous events. The DLM tries to do this in an 
efhcient manner, effectively by minimizing the difference 
of the data in the message and the DLM's internal repre- 
sentation of it B i, [H d ■ A DLM learns by processmg 
successive events but does not store the data contained 
in the individual events. 

Connecting the input of a DLM to the output of an- 
other DLM yields a locally connected network of DLMs. 
A DLM within the network locally processes the data 
contained in an event and responds by sending a message 
that may be used as input for another DLM. Networks of 
DLMs process messages in a sequential manner and only 
communicate with each other by message passing: They 
satisfy Einstein's criterion of local causality. 

For the present purpose, we only need the simplest ver- 
sion of the DLM The DLM that we use to simulate 
the operation of the Stern-Gerlach magnet is defined as 
follows. The internal state of the zth DLM, after the nth 
event, is described by one real variable Un^i- Although 
irrelevant for what follows, this variable may be thought 
of as describing the fluctuations of the applied field due 
to the passage of an uncharged particle that carries a 
magnetic moment. As the particle with spin S„.i com- 
municates (interacts) with the DLM (applied field), the 



latter updates its internal state according to 



+ 1 — ^ if Sn,i ■ a.i > lUn^l^i (AQ'] 
lUn-l,i-l + l if SnA ■ SLi < lUn-l,i ' ' 



and the spin changes according to 



+aj if S„ i 
a^ if S^^^ 



> lUn-l,i 
< lUn-l,i 



(50) 



corresponding to spin up and spin down (relative to the 
direction of the magnetic field a^), respectively. If the 
DLM selects spin up (down), it generates a Xn,i = +1 
{xn,i = -1) event. In Eqs. (l49|) and ^U^, < ^ < 1 is a 
parameter that controls the speed with which the DLM 
learns (and forgets) about the incoming events. 

The dynamic behavior of the DLM, defined by the rule 
Eq. (|49)) is discussed in detail elsewhere [ll| and may be 
summarized as follows: 

1. If the DLM receives particles with fixed spin Sn.i — 
S, the sequence {xn,i} is periodic for all n > ng, 
no depending on uq^ and I 11]. For n > no, the 
frequency N±/{N+ + 7V_) of Xn,i = ±1 events is 
given by (1 + S • ai)/2 and we have [ll| 



(51) 



1 ^ 
lim — > Xjj 



exactly. Note that the limit iV ^ cx) in Eq. ([5T|) is 
well-defined because the sequence \xn^i\ is periodic 
with a finite periodicity |ll| . 

2. If the DLM receives S„_i, statistically independent 
and uniformly distributed over the unit sphere, 
then the DLM generates the sequence Xn^ = 
sign(Sjij^- a^) for all n > no, no depending on Mo,i 
and I [ll] . In this case we have 



1 ^ 
lim — a;„ 

n=l 



0. 



(52) 



In this case, the Xn^ are Bernoulli variables and the 
law of large numbers then guarantees that Eq. ([5^ 
holds with probability one 38] . 

Thus, depending on the nature of the input sequence 
S„_i, the DLM generates output sequences {xn,i = ±1} 
and particles with spin Sn,i such that the time average 
of these sequences agree with the experimental facts. 

Pseudo-random modeL The simplest algorithm 
that performs the task of simulating a Stern-Gerlach 
magnet reads 



+ 1 if r„ < Sr, 
-1 if r„ > Sr, 



(53) 



where — 1 < r„ < 1 are uniform pseudo-random numbers 
and the spin changes according to 



Sn,2 — 



-a,: if Xji^ — +1 

Hi if Xji_i — 1 



(54) 
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It is easy to check that on average, the input-output 
behavior is the same as the one of the ideaUzed Stern- 
Gerlach magnet. 



4-. Time tags 

When a charge-neutral, magnetic particle passes 
through a Stern-Gerlach magnet it experiences a time- 
delay that depends on the direction of its magnetic mo- 
ment relative to the direction of the field in the Stern- 
Gerlach magnet. Experimentally, this time-delay is used 
to perform spectroscopy of atomic size magnetic clus- 
ters [5TI and atomic interferometry (s^ . As a simple 
simulation model for this time delay mechanism, we as- 
sume that the time delay tn,i of a particle with spin Sn,i 
is distributed uniformly over the interval [to,to + Tn^i]. 
Similarly, experimental evidence that the time-of-flight 
of single photons passing through an electro-optic mod- 
ulator fluctuates considerably can be found in Ref. [5^. 
The idea that these fluctuations might be responsible for 
the observed "quantum correlations" has been proposed 
in our earlier work [2l|. 

From Eq. ((S]), it follows that only differences of time 
delays matter. Hence, we may put to — 0. The time-tag 
for the event n is then tn.i S [0,T„.i]. We thus need an 
explicit expression for Tn.i- The choice Tn^i = constant is 
too simple: In this case we recover the model considered 
by Bell, which is known not to reproduce the correct 
quantum correlation Eq. [TJ] . 

Assuming that the particle only "knows" the direction 
of its own spin relative to the direction of the magnetic 
field in the Stern-Gerlach magnet, we can construct one 
number that is rotationally invariant, namely Sn,i • ai. 
Thus, we assume T„.i — _F(S„.i • a.^). As Sn,i ■ = 
cos 0s„ ia.i determines whether the particle generates a 
-1-1 or —1 signal, it is not unreasonable to expect that F 
is a function of sin0s„ ia;- After a few trials, we found 
that T„^, = Toll - (S„,i • aO'l'^/^ = y^|s„_^ x a,]"*, yields 
interesting results. Here, Tq is the maximum time delay 
which defines the unit of time and d is a free parameter 
in our model. In the sequel, we express r, W, tn^i and 
T„ i in units of Tq, which for convenience we set equal to 



5. Data analysis 

The algorithm described earlier generates the data sets 
Ti for spin-1/2 particles, just as experiment does for pho- 
tons [32j. In order to count the coincidences, we strictly 
follow the procedure adopted in the EPRB experiment 
with photons [13]. First, we choose a time-tag resolu- 
tion < T < To and a coincidence window t < W. We 
set the correlation counts Cxy {am , Pm' ) to zero for all 



data sets. Here \x~\ denotes the smallest integer that is 
larger or equal to x, that is [a;] — 1 < a: < \x] . Accord- 
ing to the procedure adopted in the experiment [12], an 
entangled pair is observed if and only if ]A:„,i — kn,2\ < 
k — \W/t~\. Thus, if ]fc„_i — kn^2\ < k, we increment the 
count Cx^ i.x„ 2i'^rn, Pm')- After processing all the data 
for the N events, we compute the single-particle expecta- 
tion values and the correlation according to Eq. ([¥]) and 
Eq. ([6]), respectively. 



B. Deterministic model: Results 

1. Simulation of Case I and II 

We first demonstrate that the simulation model repro- 
duces the results of quantum theory in the case of the 
EPRB experiment (Case I). In Fig. [6] we show simula- 
tion data for fc = 1, d = 0, 3, r 0.001, I = 0.999, and 
M — 10, N ^ 10^ for 100 randomly chosen values of 
ai • 3.2, covering the interval J— 1, +1]. At the nth event, 
two uniform pseudo-random numbers 1 < m,m' < M are 
used to select the rotation angles a„^i = b^.m. Within the 
statistical errors, for the pseudo-random number gener- 
ators that we use (65j . the correlation between m and 
to' is zero. The solid line is the prediction of quantum 
theory, see second column of Table HI It is clear that for 
d = 3 there is excellent agreement between simulation 
and quantum theory. This is not an accident. Simula- 
tions for d = 3 but with different values of the other pa- 
rameters (results not shown) confirm that for sufficiently 
small T and sufficiently large N, the simulation model 
reproduces the quantum theoretical results listed in the 
second column of Table [J 

Second, to simulate Case II, we let the source produce 
particles with fixed polarization but we do not change 
the algorithm that simulates the observation stations. In 
Fig. [71 we present simulation data for A: = 1, d = 0,3, 
T = 0.001, I = 0.999, and TV = 10^ ai = (0, 0, 1), aa = 
(1/2,1/2,1/V2), and S„,, = (-l)'+i(sin77, 0, cos 77) for 
< 77 < TT. For this choice of ai, a2 and Sn,i, quantum 
theory predicts (see Table I) 



E2{^2) 

E{aii,a.2) 



= cos?7, 

sin r] + \/2 cos ry 
" 2 ' 

(sin ?7 + \/2 cos 77) cos 7 



(55) 



x,y = ±1 and m,m' 
cretized time tags kn^-, 



l,...,Af. We compute the dis- 
= \tn,i/T~\ for all events in both 



and for d = 3, as shown in Fig. [71 the simulation model 
reproduces the quantum theoretical results very well. 

Extensive tests (data not shown) lead to the conclusion 
that for d = 3 and to first order in W, our simulation 
model reproduces the results of quantum theory of two 
5=1/2 objects, for both Case I and Case II. 

Also shown in the left panel of Figs. [5] and [7] are the re- 
sults for ii^(ai, a2) if we ignore the time-delay data (equiv- 
alent to d = or W > To). In Case I (see Fig. we 
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FIG. 6: Comparison between the event-based simulation results obtained by using a deterministic model for the Stern-Gerlach 
magnets and quantum theory for Case I. Left: The two-particle correlation i?(ai,a2) as a function of ^aia2 = arccos(ai • 3.2). 
The simulation results are for A: = 1, r = 0.001, I = 0.999, M = 10, iV = lO'', d = 3 (red bullets) and d = (blue squares), 
the latter corresponding to discarding the time-tag data (equivalent to > To). Solid line (black): _E(ai,a2) — — cos^aia2, 
as obtained from quantum theory. Right: Single-particle expectation value as a function of 6i = arccos(ai • z), where z is the 
unit vector in the z-direction. The simulation results are for = 1, r = 0.001, / = 0.999, A4 = 10, N = 10®, and d = 3. Bullets 
(red): iJi(ai,a2); Squares (blue): _B2(ai,a2). Solid line (black): £'i(ai) = £'2(a2) = for all orientations of the Stern-Gerlach 
magnets, as obtained from quantum theory. 
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FIG. 7: Same as in Fig. |6] except that we simulate Case II with ai — (0,0,1), a2 — (1/2, 1/2, 1/a/2), S„,i = 
(—1)'^^ (sin?;, 0, cos 77) for < 77 < vr, and that we plot the two-particle correlation and the single-particle expectation value as 
a function of rj. 



obtain simulation results that agree very well with the 
result that is obtained by considering the class of mod- 
els studied by Bell. In Case II, i?(ai,a2) is given by the 
expression in Eq. (|55p and up to the usual statistical fluc- 
tuations, the simulation data (see Fig. ^ do not depend 
on the value of the time-tag parameter d and the time 
window W. 



2. Case I: Numerical treatment 

As a check on the simulation results for Case I, we ex- 
amine the limit — > 00 and show that to first order in 
W, the simulation model yields the two-particle correla- 
tion that is characteristic for the singlet state [21, 33]. 

In the case of Case I we may replace the DLM model 
for the Stern-Gerlach magnet by the more simple model 
that generates data according to Xn,i = sign(S„.i • a^). 
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FIG. 8: Graphical representation of the process of counting 
pairs. The time interval is divided in bins of size r, repre- 
sented by the elementary squares. The two parallel, 45° lines 
indicate the time window W , which was chosen to be 2r in 
this example. In the limit N ^ oo, the total number of pairs 
for fixed a; and {(fi, 9) is given by the number of whole squares 
that fall within the time window and satisfy 1 < ki < Ki for 
i = 1,2. For Ki > K2, all filled squares contribute while 
for K'l = K2, the dark gray square does not contribute. For 
Ki < K2 we interchange labels 1 and 2. 



For N ^ 00, Eq. ^ can be written as 



E{aii,a2) 



J^"" xiX2D{Ti,T2, W) sin 9d9dip 



(56) 



where D{Ti,T2,W) is the density of coincidences for 
fixed SLi and angles (93, 9) (within a smaU surface area 
smOdOdip), Ti = F{Si ■ a^), — Si{ip,6) and Xi — 
sign(Sj • ai). 

An analytical expression for D{Ti,T2,W) can be de- 
rived as follows. For a fixed time-tag resolution < 
T < 1, the discretized time-tag for the nth detection 
event is given by kn^i = \tn,iT~^~\ where [a;] denotes the 
smallest integer that is larger or equal to x. The dis- 
cretized time-tag takes integer values between 1 and 
Ki = \T~^Ti\, where Ki is the maximum, discretized 
time delay for a particle carrying angles (ip, 9) and pass- 
ing through a Stern-Gerlach magnet with orientation a.; . 
If \kn,i — kn,2\ < k = \t~^W~\, the two spin-1/2 parti- 
cles are defined to form a pair. For fixed a^ and (</3, 6*), 
we can count the total number of pairs, or coincidences 
C{Ki , K2 , fc) , by considering the graphical representa- 
tion shown in Fig. [51 After a careful examination of all 
possibilities, we find that the density can be written as 
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FIG. 9: Results for S{e) = S{&, b^^c, d) where a • c = b ■ c = 
b ■ d = cos 6 and a • d = cos 0] , for the case k — W = 1 
and d — 0, . . . , 5, as obtained by numerical evaluation of 
Eq. JSni). Solid squares (gray): d ^ {or W > To); Plusses 
(blue): d = 1; Crosses (green): d = 2; Bullets (red): d = 3; 
Open squares (magenta): d — 4; Open diamonds (blue): 
d = 5. Dashed horizontal lines at +2 (—2): Maximum 
(minimum) value if the system is described by a factorizable 
two-particle probability distribution. Solid horizontal lines at 
+2\/^ (— 2\/2): Maximum (minimum) value if the system is 
described by the quantum theory for two spin-1/2 particles. 
Solid line: S{9) = cos 3^ — 3 cos 0, as obtained from quantum 
theory. 



D(Ti,T2,r) = C{Ki,K2A)/KiK2 where 
CiKi,K2,k) 



= (2fco-l)fci2-fco(fco-l)/2 

- max(0, {K12 - 1) max(0, ifi2)/2) 
-I- max(0, k — A;o)A;o 

- max{0,kki2- K1K2), (57) 



and fco = min(_R"i, fc), ki2 = min(_R"i, ^2), and 
K12 = fci2 — max(0, max(_K'i, ^2) — fc). After substituting 
Eq. ([57|l into Eq. ([56]) . the remaining integrals are easily 
calculated numerically. 

In Fig. [9] we present results for S{9) = S{a, b, c, d) for 
the case k = W — 1 and d — 0, . . . , 5 and the choice 
a c — b e = b d = cos 61 and a • d = cos 30 0]. 
For d = (or W > Tq), we find that S{9) < 2. Thus, 
we see that ignoring time-tag data automatically renders 
our model incapable of producing data that violates the 
Bell inequalities [ill. For 1 < d < 3, 2 < Smax < 2\/2 
and hence, the model violates the Bell inequality but 
does not reproduce the correlations of the singlet state. 
As expected on the basis of our results for i?(ai,a2), 
if c? = 3, the numerical results produced by our model 
are indistinguishable from the quantum theoretical re- 
sult S{9) = cos 36* - 3 cos 6*. For d>3, 2^2 < S„^ax < 4, 
implying that our model exhibits correlations that can- 
not be described by the quantum theory of two spin-1/2 
particles, even though it rigorously satisfies Einstein's cri- 
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FIG. 10: Maximum of ^(a, b, c, d) as a function of the time 
window W relative to the time-tag resolution r for a ■ c = 
b ■ c = b ■ d = cos 8 and a • d = cos 36^. Curves from bottom 
to top: Results obtained from Eq. (|56p for d = 0, 1, . . . , 10. 
Dashed line: Value of Smax ~ 2\/2 if the system is described 
by quantum theory. 



teria for local causality. 

It is clear that the result for the coincidences de- 
pends on the time-tag resolution r, the time window 
W and the number of events iV, just as in real exper- 
iments [23, [13, m 113, H H [31I, see Section HTCl 
Expression Eq. allows us to easily study the behav- 
ior of the model as a function of the time window W, 
relative to the time-tag resolution r. In Fig. \W\ we plot 
Smax as a function of W/ r for various values of d. Note 
that the numerical results agree with the values of Smax 
that can be obtained analytically for the limiting cases 
= r 0, d = 0, 3 and > To (see Sec. IVBl . From 
Fig. [TUl it is clear that for d = 3 and W = 0, the model 
reproduces the result of the quantum system in the fully 
entangled state. Furthermore, Fig. [TOl shows that, for suf- 
ficiently small time-tag resolution t, increasing the time 
window changes the nature of the two-particle correla- 
tions. Since is a parameter solely used in the data 
analysis procedure and Smax is a decreasing function of 
W, the value of Smax and/or of the correlations are not 
sufficient to make a definite statement about the nature 
of the source or even the nature of the complete set-up. 



3. Case I: Exact solution 

For some choices of the parameters, Eq. (15^ can be ex- 
pressed in closed form. We first examine the case W > Tq 
for which e(VF- |t„,i-t„,2|) = 1 and D{Ti,T2,W) = 1. 
Without loss of generality, we may choose the coordinate 
system such that ai = (1,0,0) and 3.2 = (cos a, sin a, 0). 
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FIG. 11: Comparison between the event-based simulation 
results obtained by using a deterministic model for the Stern- 
Gerlach magnets, quantum theory and the exact solution for 
the analytical model in the limit N ~* 00. The two-particle 
correlation i5(ai,a2) for Case I is shown as a function of 
OcLisL2 = arccos(ai • a2). Markers: Event-based simulation 
results obtained by using a deterministic model for the Stern- 
Gerlach magnets. The simulation parameters are fe = 1, 
T = 0.001, / = 0.999, M = 10, AT = 10^ d = 5 (red bullets) 
and d — (blue squares), the latter corresponding to dis- 
carding the time-tag data (equivalent to W > To). Solid line 
(black): Quantum theory i5(ai,a2) — — cos&aia2. Dashed 
line (black): Rigorous result Eq. (|61|l for the simulation model 
for d = 5 in the limit W = t ^ 0. 



Then, Eq. ^ reduces to [U 



1 /"Stt 

i?(ai,a2) — / / xiX2Sm9d9dip 

27r Jo Jo 

2 1 arccos(ai • a2)| 



(58) 



Obviously, Eq. (|58p does not agree with the quantum 
theoretical expression Eq. p^ . 

Second, we consider the case in which W ^ t. For- 
mula Eq. ([Ff|) greatly simplifies if we consider the case 
k = 1 {W = t), yielding C{Ki,K2,l) = mm{Ki,K2) 
as is evident by looking at Fig. [51 For W = t 
and fixed a^ and (v, ^), the density D{Ti,T2,t) = 
C{Ki, K2,l)/ K1K2 that we register two particles with 
a time-tag difference less than r is bounded by 



min(Ti + T,Ty + t) „ „ ^ min(ri , Ty) 



{Ti+t){T2+t) 



T1T2 



(59) 

For W — T ^ and = \Si x a.ip, the integrals in 
Eq. can be evaluated in closed form. Denoting yi = 
sign(cos(/5) and 1/2 = sign(cos((/5 — a)) and using the same 
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FIG. 12: Comparison between the event-based simulation 
results obtained by using a pseudo-random model for the 
Stern-Gerlach magnets, quantum theory and the exact so- 
lution for the analytical model in the limit N ^ oo. The 
two-particle correlation i5(ai,a2) for Case I is shown as a 
function of &aia2 = arccos(ai • a2). Markers: Event-based 
simulation results obtained by using a pseudo-random model 
for the Stern-Gerlach magnets. The simulation parameters 
are fc = 1, r = 0.00001, M = 10, = 10^ d = 7 (red bul- 
lets) and d = (blue squares), the latter corresponding to 
discarding the time-tag data (equivalent to W > To). Solid 
hue (black): Quantum theory i5(ai,a2) = — cos^aiaa- 
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FIG. 13: Same as Fig. 1 121 except that bullets (red) are simu- 
lation results for d = 5. 



coordinate systems as above, we find 



£■(31, aa) = - 



2ir 



inin(sin~ 03. sin (w — a)) 

V^ Vi ^-T — .' , " 

cji-a-^ sm if sm [if ~ a ) 



dip 



lo 



27r min(sin^ cp,sin^ (i/? — a)) 



' ip sin-^ (c/p — q) 



dip 



= ai • a2. 



(60) 



which is exactly the same as the quantum theoretical re- 
sult Eq. (I19p . In retrospect, it is remarkable that we ob- 
tain Eq. (I60p by requiring that the results do not depend 



on W and r, which in this case is very much the same 
as hypothesis (3) of Sec. IVIIIi used in the probabihstic 
modeling of the EPRB experiment. 

For other integer values of d, the integrals can be 
worked out as well but the calculations are rather te- 
dious and the results are not very illuminating. As an 
example, we give the expression for c? = 5: 



£^(ai,a2) 



ai • a.2 



15 - 7(ai • a2) 
11 - 3(ai • as) 



(61) 



In Fig. Ill[ we demonstrate that the simulation data for 
d = 5 agree very well with the analytical result Eq. (|6ip . 
As shown in Fig. [9l for c? = 5, the data not only violate 
the Bell inequality but also violate the rigorous upper- 
bound Smax < 2\/2 for a quantum system of two S* = 1/2 
particles. 



C. Pseudo-random model: Results 

Using the simple pseudo-random model for the Stern- 
Gerlach magnet yields results that are qualitatively the 
same as those of the deterministic model. Therefore, 
we present a few, representative simulation results only. 
A detailed analytical treatment of the pseudo-random 
model is given in Section [VI 1 1 and fully supports the sim- 
ulation results described next. 

In Fig. [13 we demonstrate that the simulation results 
for d — 7 are in excellent agreement with the quantum 
theoretical expression for the correlation in the singlet 
state. However, as we prove in Section [VIIl if the number 
of events goes to infinity, there is no exact agreement: 
There is a difference between the two curves of maximum 
2%. Note that in the case of the deterministic model 
exact agreement is obtained for d = 3. Also notice that 
there is some weak but systematic deviation from the 



exact results for 



and 9a 



TT. This is due to 



the pseudo-random nature of the model: It reproduces 



the perfect (anti) correlation at 9^ 



0, TT in the limit 



N ^ IX only, as shown rigorously in Section IVIII 

The results for d = 5 and d = 9, presented in Figs. [T3I 
and I14i respectively, show the same trend as we observed 
when using the deterministic model for the Stern-Gerlach 
model: For d — 5 the correlation is less strong than for 
a quantum system in the singlet state but for d > 8 it 
is definitely stronger. Notice that for a fixed number 
of events N, the systematic deviation from the perfect 
(anti) correlation at 9^^3.2 = 0, tt increases with d. 



D. Summary 

Starting from the factual observation that experimen- 
tal realizations of the EPRB experiment produce the data 
{Ti,T2} (see Eq. ([2|)) and that coincidence in time is a 
key ingredient for the data analysis, we have constructed 
computer simulation models that satisfy Einstein's con- 
ditions of local causality and, in the case that we employ 
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FIG. 14: Same as Fig. [12] except that bullets (red) are simu- 
lation results for d = 9. 



a deterministic model for the Stern-Gerlach magnet, ex- 
actly reproduce the correlation ii^(ai, = — ai • a2 that 
is characteristic for a quantum system in the singlet state. 
In this case, both the simulation and a rigorous mathe- 
matical treatment of the model lead to the conclusion 
that for d = 3 and W ^ t ^ 0, the model reproduces 
the results (see Table I) of quantum theory for a system of 
two S* = 1/2 particles. The pseudo-random model for the 
Stern-Gerlach magnet yields data that are qualitatively 
similar but, for integer values of d, do not exactly agree 
with quantum theory (see Section IVIip . It is of interest 
to mention here that if we simulate EPRB experiments 
that use the photon polarization as a two-state system, 
both the deterministic and pseudo-random model exactly 
reproduce the quantum theoretical results [ll], [s^ . 

Salient features of these models are that they generate 
the data set Eq. ^ event-by-event, use integer arith- 
metic and elementary mathematics to analyze the data, 
do not rely on concepts of probability and quantum the- 
ory, and provide a simple, rational and realistic picture of 
the mechanism that yields correlations such as Eq. 

One may wonder why particles emitted by a source 
with definite spin orientations that are exactly opposite 
to each other are not described by a density matrix that 
is a product state. Of course, in this respect the descrip- 
tion of our model may be deceptive. In a naive picture 
one might think that the whole system is described by 
a density matrix that is a product state. The problem 
with this naive picture is that it often works extremely 
well but in some cases leads to all kinds of logical in- 
consistencies (see Ref. [1] for an extensive discussion of 
this point) and it should not come as a surprise that the 
EPR problem is the prime example where the naive pic- 
ture breaks down completely. Quantum theory describes 
the system as a whole: It does not describe a single pair 
of particles as they leave the source. 

Another deceptive point may be that in our model, one 
can compute the correlation of the particles right after 



they left the source. This correlation is exactly minus 
one. However, this correlation has no relevance to the 
experiment: To measure the correlation of the particles, 
it is necessary to put in the Stern-Gerlach magnets, de- 
tectors, timing logic and so on. We emphasize that the 
simulation procedure counts all events that, according to 
the same criterion as the one employed in experiment, 
correspond to the detection of two-particle systems. 

Our simulation results also suggest that we may have 
to reconsider the commonly accepted point of view that 
the more certain we are about a measurement, the more 
"classical" the system is. Indeed, according to experi- 
ments and in concert with the prediction of our model, 
this point of view is in conflict with the observation that 
the more we reduce this uncertainty by letting W —^ 0, 
the better the agreement with quantum theory becomes. 
Both in experiments and in our model, the uncertainty 
is in the time-tag data and it is this uncertainty that 
affects the coincidences and yields the quantum correla- 
tions of the singlet state (if W ^ 0). Isn't it then very 
remarkable that the agreement between experiment and 
quantum theory improves by reducing (not increasing!) 
the uncertainty by making W as small as technically fea- 
sible? 

We have shown that whether or not these simulation 
models produce quantum correlations depends on the 
data analysis procedure that is performed (long) after 
the data has been collected: In order to observe the cor- 
relations of the singlet state, the resolution r of the de- 
vices that generate the time-tags and the time window 
W should be made as small as possible. Disregarding 
the time-tag data {d = or W > Tq) yields results that 
disagree with quantum theory but agree with the models 
considered by Bell [l3| . Our results show that increasing 
the time window changes the nature of the two-particle 
correlations. This prediction can easily be tested and 
is confirmed by re-analyzing available experimental data 
with different values of the time window W, as we did in 
Section EGI 

In Case I, the two-particle correlation depends on the 
value of the time window W . By reducing W from infin- 
ity to zero, this correlation changes from typical Bell-like 
to singlet-like, without changing the procedure by which 
the particles are emitted by the source. Thus, the char- 
acter of the correlation not only depends on the whole 
experimental setup but also on the way the data analysis 
is carried out. Hence, from the two-particle correlation 
itself, one cannot make any definite statement about the 
character of the source. Thus, the two-particle correla- 
tion is a property of the whole system (which is what 
quantum theory describes), not a property of the source 
itself. 

In contrast, in Case II, the observation stations al- 
ways receive particles with the same spin orientation and 
although the number of coincidences decreases with W 
(and the statistical fluctuations increase), the functional 
form of the correlation does not depend on W: In Case 
II, the single-particle and two-particle correlations do not 
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depend on the value of the time window W. 



VI. EINSTEIN'S LOCALITY VERSUS BELL'S 
LOCALITY 

Starting from the data gathering and analysis proce- 
dures used in EPRB (gedanken) experiments, we have 
constructed an algorithm in which every essential clement 
in the experiment has a counterpart (see Section[Tl|. The 
algorithm generates the same type of data as recorded in 
the experiments. The data is analyzed according to the 
experimental procedure to count coincidences. The algo- 
rithm satisfies Einstein's criteria of local causality, does 
not rely on any concept of quantum theory but neverthe- 
less reproduces the two-particle correlation of the singlet 
state and all other properties of a quantum system con- 
sisting of two = 1/2 particles. 

At first sight, our results may seem to be in contra- 
diction with the folklore on the EPR paradox, very often 
formulated in terms of Bell's theorem which states that 
quantum theory cannot be described by a local hidden 
variable model. In fact, there is no contradiction once 
one recognizes that the concept of locality, as defined by 
Bell, is different from Einstein's definition of locality. Bell 
made an attempt to incorporate Einstein's concept of lo- 
cality (defined on the level of individual events) to proba- 
bilistic theories, apparently without realizing that proba- 
bilities express logical, not necessarily physical, relation- 
ships between events. However, the assumption that the 
absence of a causal influence implies logical independence 
leads to absurd conclusions, even for very mundane prob- 
lems [36ll40j and it is therefore not surprising that, when 
applied to the quantum problems, this assumption can 
generate all kind of paradoxes [s^ . 

The simulation model that we describe in this pa per , 
and similar models that we described elsewhere [8l.l9l.llll. 
[Tsl . [66] do not rely on concepts of probability theory: 
They operate on the ontological, event-by-event level. 
Therefore it would be logically inconsistent to even at- 
tempt to apply Bell's notion of locality to these models. 
However, the fact that we have proven that there exist 
event-based models that satisfy Einstein's criterion of lo- 
cality and causality and also reproduce all properties of 
a quantum system consisting of two S = 1/2 particles, 
suggests that it may be of interest to revisit the relation 
between locality a la Einstein and locality a la Bell. 

Before we address this issue, we want to make clear 
that we do not question the validity of the Bell-type in- 
equalities. These inequalities are mathematical identities 
that are useful to characterize the amount of (quantum) 
correlation between two quantities. In this section, we 
focus on the logic that is used to address the meaning 
of "locality" in quantum physics. In the discussion that 
follows, we assume that all processes are causal, that is 
they should be physically realizable, and we implicitly 
exclude all others. 



A. Einstein's locality criterion 

Einstein expressed the principle of locality as the real 
factual situation of the system S2 is independent of what 
is done with the system Si, which is spatially separated 
from the former [2] . We formalize this by introducing 

Definition: A theory is E-local if and only if it satisfies 
Einstein's principle of locality for each individual event. 

Clearly, E-locality applies to each individual fact (onto- 
logical level). Recall that quantum theory or probability 
theory have nothing to say about individual events: They 
describe phenomena on the epistemological level. 

The simulation model that we describe in this paper 
is a purely ontological model of the EPRB experiment 
that can reproduce the results of quantum theory. From 
the description of the simulation algorithm, it is evident 
that Xn,i and t„^i depend on the variables (ifm^^n) that 
represent the magnetic moment of a particle, and on the 
orientation a„^i of the Stern-Gerlach magnets, which can 
be chosen at will for each {n,i). Furthermore, the event 
n cannot affect the data recorded for all n' < n, implying 
that the algorithm simulates a causal process. In addi- 
tion, it is obvious from the specification of the algorithm 
that Xn,i, tn,i, or a„_i do not depend (in any mathemat- 
ical sense) on a„^2 nor do Xn,2, t,n,2^ or a„^2 depend on 
a„^i. This implies that for each event, the numbers Xn,i 
and tns {xn,2 and tn.2) do not depend on whatever ac- 
tion is taken at observation station 2 (1). Summarizing: 
Our simulation model is E-local and causal. 



B. Bell's locality criterion 

To set the stage, we first recall the axioms of prob- 
abihty theory HI- Let A, B, and Z denote 

some propositions (events) that may be true (may oc- 
cur) or false (may not occur). The probability that 
A is true, conditional on Z being true, is denoted by 
P{A\Z) [13, Hi]. The axioms of probability theory may 
be formulated as d, [s^ 

1. < P{A\Z) < 1. 

2. P{A\Z) + P{A\Z) = 1, ^ denoting the logical nega- 
tion of A. 

3. P{AB\Z)^P{A\BZ)P{B\Z)^P{B\AZ)P{A\Z). 

These three axioms are necessary and sufficient to de- 
fine a consistent mathematical framework for probability 
theory. 

By definition, two events A and B are logically inde- 
pendent if and only if P{A\BZ) = P{A\Z) [13, Is^. If 
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the events A and B are logically dependent, we have 
P{A\BZ) P{B\AZ) 



P{A\Z) 



P{B\Z) 

P{B\AZ)P{A\Z) 
P{B\Z)P{A\Z) 
P{AB\Z) 



P{A\Z)P{B\Z) 



(62) 



showing that the assignment of the probability of the 
event A (B) depends on the knowledge of the event B 
{A). From Eq. ((62]), we see that P{A\BZ) ^ P{A\Z) 
unless the events A and B are logically independent (we 
may assume P{A\Z) > and P{B\Z) > because of the 
fact that we actually registered A and B). As we shall 
see shortly, the definition of logical independence is of 
extreme importance for understanding the implications 
of Bell's definition of locality. 

Bell considers theories (see Ref. tM Chapt.7) that 
assign a probability for an event A to be registered, 
given that the circumstances under which A is regis- 
tered are described by another event Z. The events 
A and Z are propositions of the kind "the values of 
the variables (as recorded by m measurement devices) 
are A = {Ai, . . . ,Ajn}" and "the values of the vari- 
ables (as recorded by n measurement devices) are Z — 
{Zi, . . . , Zn}" ■ Bell considers the case that the events A 
and B are localized in regions 1 and 2 respectively, and 
assumes that region 1 and 2 are separated in a space- 
like way such that events in region 1 (2) have no causal 
influence on events in region 2 (1) p^ . 

According to probability theory, we have [13, [38| 



P{AB\abz) = P{A\Babz)P{B\ahz), 



(63) 



where we introduced the notation X and Y to indicate 
that event X (Y) can have no causal effect on event Y 
[X). We also made explicit that the condition Z — dbz 
under which A and B have been registered may be writ- 
ten in terms of a common condition z and conditions a 
and b that may have a causal effect on the outcome of A 
and B, respectively. Note that a, b and z are propositions 
too. 

According to Bell, since the events B and b can have no 
causal effect on the event A, in a local causal theory 



and, similarly. 



P{A\Babz) = P{A\az), 



P{B\abz) = P{B\bz), 



yielding 



P{AB\abz) = P{A\az)P{B\bz). 



(64) 



(65) 



(66) 



The steps that take us from Eq. to Eq. clearly 
show that Bell believes that the absence of a causal influ- 
ence implies logical independence. In fact, within prob- 
ability theory, Eq. (|66p is the formal statement that A 
[B) is logically independent of b (a) (see Eq. (jH))). 



According to Bell, theories that do not satisfy Eq. (I66p . 
such as quantum theory, are not locally causal . The- 
ories that satisfy Bell's criterion of locality, as expressed 
by Eq. (|64p . will be called B-local. We formalize this by 
introducing 

Definition: A theory is B-local if and only if Eqs. ((64)) 
and (|65p are satisfied 

or equivalently. 

Definition: A theory is B-nonlocal if and only if 
Eqs. ([M)) or (p5|) are not satisfied. 

Clearly, B-locality is defined within the realm of prob- 
abilistic theories only. Note that the folklore on the 
EPR paradox generally does not distinguish between B- 
locality and E-locality, a remarkable logical leap because 
E-locality is defined on the level of individual events 
whereas B-locality is defined in terms of probabilities for 
events to occur. 

A possible explanation for not noticing that this is a 
major logical step to take is that it is quite common to 
mix up the meaning of frequencies and probabilities. The 
former is a property that we measure by counting. It is 
a property of the whole system under study. The latter 
is a mental, mathematical construct that allows us to 
reason about the former. The reader who has difficulties 
to grasp this delicate but fundamental point may find it 
useful to read Sec. IIVCTT] once more. 

If the events A and B are represented by integer or 
real variables A and B (a minor abuse of notation), the 
expectation of the joint event AB conditional on ab is 
defined by [13, [11 



{AB)ab = ^ABP{AB\abz 



A.B 



If Eq. ([55)1 holds, we have 



EB{a,b) = {AB)^-, = {A)a{B)i. 



(67) 



(68) 



where we used the subscript B to indicate that we have 
assumed that the theory is B-local. Let us focus on the 
case that ~1 < A < \ and — 1 < B < 1. Denoting 
a = (A) a, b = {B)f^, c — {C)c, a, b, and c all lie in the 
interval [—1, 1] and we have 



hence 



\ab-ac\<\b-c\<l-bc, (69) 



lEsia, b)-EBia,c)\+ Egib, c) < 1, (70) 



which has the form of one of the Bell inequalities (other 
inequalities can be derived in exactly the same manner) 
but lacks the element of the hidden variables (see later). 

A B-local theory can never violate the inequality 
Eq. dTO]). If, we find that inequality Eq. ^ is violated 
for some E{a,b), the only conclusion that can be drawn 
is that E{a,b) cannot be obtained from a B-local proba- 
bilistic theory. 
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To appreciate the consequences of Bell's definition of 
a local theory, it is very instructive to apply it to ex- 
amples that do not require concepts of quantum theory. 
We first consider a very simple experiment that shows 
that application of Bell's definition of locality leads to 
the conclusion that an urn filled with balls of two differ- 
ent colors is described by a theory that is B-nonlocal [s^ . 
Second, we show that Bell's assumption that the absence 
of causal influence implies logical independence enforces 
very strong conditions on the functional dependence of 
the probability distributions, severely limiting the (clas- 
sical) phenomena that a B-local theory can describe. 



1. Bernouilli's urn is B-nonlocal 

Let us take an urn filled with M red and N — M white 
balls (it is sufficient to take N — 2 and M = 1 to see the 
consequences of Bell's definition of locality) [s^. A blind 
monkey, having no knowledge about the position of the 
balls in the urn, draws two balls without putting the first 
ball back into the urn. We consider the events i?i = "the 
result of the first draw is a red ball" and i?2="the result 
of the second draw is a red ball". Denoting all other 
knowledge about this experiment by Z, the probabilities 
for i?i and R2 are 



PiRi\Z) ^ PiR2\Z) 



M 



(71) 



If the result of the first draw is a red ball, the probability 
that the result of the second draw is also a red ball is 
given by 



P{R2\RiZ) = 



M-1 



(72) 



Let us now assume that the monkey hides the first ball 
from us but that it shows us the second ball, which turns 
out to be red. As there can be no causal effect of the 
second draw on the result of the first draw, application 
of Bell's reasoning to this experiment yields 

. , . M 

P{Ri\R2Z) = P(Ri\R2Z) = P{Ri\Z) = — , (73) 

which is obviously inconsistent with the basic rules of 
probability theory. Indeed, from axiom 3, we have 

P{Ri\R2Z)P{R2\Z) = P{R2\RiZ)P{Ri\Z)., (74) 



and using Eq. ([7T|) we find 

P{Ri\R2Z) ^ P{R2\RiZ) 



M - 1 
iV- 1 ' 



(75) 



which is definitely in conflict with Eq. ([75]) . Thus, Bell's 
assumption that the absence of a causal influence implies 
logical independence leads to inconsistent results in prob- 
ability theory when applied to the simple physical system 
of an urn filled with red and white balls ^36|. 



2. B-local hidden variable theories 

We now demonstrate that a consistent application of 
Bell's definition of locality imposes severe constraints 
on the functional form of the probabilities. Following 
Ref. [13, let us introduce a new set of K exhaustive, mu- 
tually exclusive events Hk {k — 1,...,K), exhaustive 
implying that Hi + . . . -I- Hk is always true. Then, ac- 
cording to the rules of probability theory [STj 



P{AB\abz) = P{AB{Hi 



K 



= ^P{ABHk\ahz) 
fc=i 

K 

= ^P{AB\Hkabz)P{Hk\abz). (76) 



k=l 



To make contact to Bell's work, we write A instead of Hk, 
call them hidden variables and replace the summation by 
an integration. We have 



P{AB\abz) = j P{AB\abzX)P{X\abz)dX. 



(77) 



The variables A may have a causal influence on the events 
in regions 1 and 2, hence they may affect the events A 
and/or B. Invoking the product rule, we find [37 1 



P{AB\abz) = J P{A\BabzX)P{B\abzX)P{X\abz)dX. 

_ (78) 
Following Bell [lj|, in his locally causal theory, Eqs. 
and (|65p hold and therefore Eq. ([75)1 simplifies to 

P{AB\abz) = J P{A\azX)P{B\bzX)P{X\abz)dX. (79) 

Note that if there was a logical dependency between the 
events A and B, we definitely destroyed it by dropping 
B in P{A\BdbzX). 

Let us now make the (physically reasonable) assump- 
tion that the events A are logically independent of d and 
6, an assumption which is also implicit in the work of 
Bell (because he ignored the difference between physical 
and logical independence). In other words, it is assumed 
that 

P{X\dbz) = P{X\dz) = P{X\bz) = P(A|z). (80) 
Then, Eq. ^ simphfies to 

P{AB\dbz) = J P{A\dzX)P{B\bzX)P{X\z)dX, (81) 

which is the expression for the joint probability 
P{AB\dbz) under the hypothesis of B-locality [l^ . 

The famous Bell inequality [H| follows from Eq. ([HTjl 
by repeating the steps that lead to Eq. ([70[) . We denote 
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the expectation value of AB by 

E§{a,b) = j^ABP{A\azX)P{B\bz\)P{\\z)d\ 



A,B 



(82) 



where the superscript ^ indicates that we compute the 
expectation using the "hidden variable" probability dis- 
tribution defined by Eq. ([STil . As before, we focus on 
the case that -1 < A < 1 and -1 < S < 1. Then 
a(A) = (i)aa, 6(A) = c(A) = {C)c,x, all lie in the 

interval [—1, 1]. We have 

ai\)b{X)P{\\z)dX - J a{\)c{\)P{X\z)d\ 
\a{X)b{X) - a{X)c{X)\P{X\z)dX 
< I {l-b{X)c{X))P{X\z)dX 
1 - I b{X)c{X)P{X\z)dX, 



< 



(83) 



hence 



\E^{a,b)-E^{a,c)\+E^{b,c)<l. 



(84) 



Logical consistency of a B-local theory demands that 
we may first apply Eqs. ([64]) and ([65]) to P{AB\dbz) and 
then insert the events A. This gives 

P{AB\dbz) = P{A\az)P{B\bz) 

P{A\azX)P{X\dz)dX 



X i^j P{B\bzX)P{X\bz)dXj . (85) 
Then, Eqs. §B) and §B yield 

P{AB\abz) = I P{A\dzX)P{B\bzX)P{X\z)dX 

P{A\azX)P{X\z)d}^ 

X ( / P{B\bzX)P{X\z)dX ) , (86) 



and we see that in order for Bell's local probabilistic 
theory to be mathematically consistent, the probabil- 
ities P{A\dz\) and P{B\bzX) should satisfy Eq. ((86)) . 
for all A, B, d, and b, and for all P{X\dbz) satisfying 
Eq. ([80]) . Furthermore P{AB\dbz) is completely deter- 
mined by P{A\azX) and P{B\bzX). Assuming, as is usu- 
ally done, that the two measuring devices are the same, 
we may write Eq. (|86p as the functional equation 



FiA,a,X)F{B,b,X)p{X)dX 



F{A,a,X)p{X)dX / F{B,b, X)p{X)dX, (87) 



where < F{A,a,X) < 1, < F{B,b,X) < 1 and < 
P(A) < 1. 

It may be of interest to note that the quantum theo- 
retical expression for the single-particle probability 



F(A,a,S) 



1 + • S 



describing a Stern-Gerlach magnet (for which A = ±1), 
does not satisfy functional equation Eq. (j87p . assuming 
Eq. ([80]) holds here too. Indeed, integrating over S over 
the unit sphere yields (1 -I- ABa • b/3) = 1 for the con- 
sistency condition Eq. ([57)) . which obviously leads to a 
nonsensical conclusion (see also the Appendix). 

Within probability theory, a mathematically consistent 
application of B-locality severely limits the form of the 
probabilities and, as in the case of the urn, leads to con- 
clusions that defy common sense, even in the realm of 
every-day experience. 



C. Reductio ad absurdum 

We now address the logic of the reasoning that was 
used by EPR and then apply the same logic to the rea- 
soning used by Bell. We emphasize that we consider the 
logic of reasoning only. For instance, whether or not 
quantum theory is a correct description of the experi- 
mental data is not the issue here. We are concerned with 
logic only. 

The argument put forward by EPR can be formalized 
as follows 

1. Q is true is equivalent to the statement that quan- 
tum theory is a correct description of the experi- 
mental data. 

2. C is true is equivalent to the statement quantum 
theory is complete. Note that the precise definition 
of "complete" is irrelevant as far as the logic of 
reasoning is concerned. 

EPR use the formalism of quantum theory to prove that 
quantum theory is incomplete. Thus, EPR show that if 
quantum theory is a correct description of the experimen- 
tal data and quantum theory is complete then quantum 
theory is incomplete. This reasoning is an example of 
reductio ad absurdum: To disprove a statement, we as- 
sume it is true and then prove that it leads to a logical 
contradiction. 

In formal language, EPR prove that 



(89) 



where A, and denote the logical "and" operation, 
logical implication, and logical negation, respectively. 
Equivalently, we can write 



(90) 
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where V denotes the logical "or" operation. From 
Eqs. ([55)) or (PO]). it is clear that if we accept that state- 
ment Q is true, statement C must be false if we do not 
want to run into a contradiction. 

We now apply the logic of the reasoning used by EPR 
to the reasoning used by Bell. First, we introduce the 
symbol E: 

3. E is true is equivalent to the statement that 
quantum theory obeys Einstein's criterion of local 
causality (the precise meaning of this criterion is 
irrelevant for the logic of reasoning). 

Bell's extension of Einstein's criterion for a locally causal 
theory to probabilistic theories can be formalized as fol- 
lows: 

4. B is true if and only if Einstein's criterion of local 
causality is equivalent to the statement that if a 
variable b has no causal effect on the variable A 
then, in a probabihstic theory, P{A\bZ) = P{A\Z) 
must hold. 

Assuming B is true. Bell derives inequalities that are 
violated by quantum theory. In formal language. Bell 
has shown that 

QAEAB^E, (91) 

which is a logical contradiction. Assuming that quantum 
theory gives a correct description of experimental data, 
Q is true. Then, from Eq. ([?T|) . if follows that 1) B is 
false or 2) E is false or 3) both B and E are false. Bell 
apparently excluded the possibility that his probabilistic 
interpretation of Einstein's criterion of local causality was 
wrong, hence he drew the conclusion that quantum the- 
ory is E-nonlocal. However, Bell's conclusion that quan- 
tum theory is E-nonlocal has been drawn on the basis 
of a logically incorrect argument: B-locality implicitly 
assumes that the absence of a causal influence implies 
logical independence [36| but, in probability theory, it 
is well-known that the assumption that the absence of 
a causal influence implies logical independence leads to 
logical inconsistencies [36, 40]. Hence, either B is false 
or the mathematical framework of probability theory is 
logically inconsistent. 

Excluding the hypothesis that probability theory is 
logically inconsistent, it follows that B is false but we 
cannot rule out that E is false also. However, Bell's gen- 
eral conclusion that an E-local, causal theory cannot be 
a candidate for a more complete theory than quantum 
theory is based on the wrong assumption that B is true. 
B-locality only looks deceptively similar to E-locality but 
is fundamentally different. 

Thus, we are left with three options: (1) We adopt 
Bell's definition of locality, keep insisting that causal in- 
denpence implies logical independence and learn to live 
with the fact that it leads to absurd conclusions such as 
an urn with two balls being "nonlocal" , (2) we change 
the rules of probability theory t67| or (3) we keep using 



probability theory as it is and reject Bell's definition of 
locality as a logically consistent extension of Einstein's 
notion of locality to the domain of probabilistic theories. 
We do not believe that option (1) is worth considering 
any further, nor that option (2) is a viable one, in par- 
ticular because quantum theory, being a very successful 
theory, requires the established mathematical apparatus 
of probability theory to make contact to experimental 
data. 



D. Alice on Earth and Bob on Mars 

For a logically local algorithm, such as the one de- 
scribed in Sec. |Vl the condition that the two observa- 
tion stations must be spatially separated is irrelevant. 
To see this, imagine the following scenario. We ask Bob 
to choose a set of directions a„^2 as he likes and we also 
ask him to keep this set secret. Then we send Bob to 
Mars. 

After Bob has arrived on Mars, we (still on Earth) 
prepare data sets {S„.i|rt = 1,...,N} and {S„_2 — 
— S„_i|n = 1, . . . , N} for Case I and send the second set 
by a radio link to an observation station 2 that is located 
on Mars. Once this data has been sent (which takes a 
few seconds at most), the link is destroyed. Then, we 
give the first data set to Alice who is in charge of station 
1 on planet Earth. She processes her data for some set 
of directions a„_i that she may choose as she likes and 
obtains the data set Ti. This also takes a few seconds. 

It takes at least five minutes before Bob, who controls 
station 2 on Mars, starts to receive the data. Bob pro- 
cesses this data, using a set of directions a„^2 he chose 
before leaving for the mission to Mars and which he kept 
secret all the time, and obtains the data set T2. Then, 
Bob activates a radio link and sends the data set T2 to Al- 
ice (or a third person). Alice analyses the data {Ti, T2} 
and computes the correlations according to the procedure 
outlined in Sec. FVland draws the unescapable conclusion 
that the data exhibit "quantum correlations" . 

If we assume that Alice and Bob never had the chance 
to communicate with each other, there is no way, other 
than by telepathy, that Bob could have infiuenced Al- 
ice's choice of a„^i. Alice, not aware of the existence 
of Bob before Bob arrived on Mars could not influence 
Bob's choice of a„^2 either. In this hypothetical proce- 
dure, at the time that the data analysis was carried out, 
the two systems were spatially and temporally separated 
and there is no physical mechanism known to man by 
which Bob could have influenced Alice's choice. 

There is no point of sending Bob to Mars: If Bob 
would have analyzed the data {S„_2} on earth, the data 
{Ti, T2} would be exactly the same and so would be the 
conclusion that the data exhibit "quantum correlations" . 
This thought experiment (which can in fact be realized) 
is just another illustration that correlations express logi- 
cal but not necessarily physical dependencies. 
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E. Summary 

In an attempt to extend Einstein's concept of a locally 
causal theory to probabilistic theories, Bell implicitly as- 
sumed that the absence of causal influence implies logical 
independence. In general, this assumption prohibits the 
consistent application of probability theory and leads to 
all kinds of paradoxes [33, ll^l • However, if we limit our 
thinking to the domain of quantum physics, the viola- 
tion of the Bell inequalities by experimental data should 
be taken as a strong signal that it is the correctness of 
this assumption that one should question. Instead of 
calling quantum mechanics (or an urn containing two 
balls) a nonlocal theory, it would be more appropriate 
to reject the assumption that the absence of causal influ- 
ence implies logical independence. This step is difficult 
to take unless one recognizes that probabities are not de- 
fined by frequencies: Much of the recent controversies 
about the correctness and/or applicability of Bell's the- 
orem [H [H m m, H Ei, SI m, m can be traced 
back to the failure of keeping apart the concept of the 
frequency of events and the concept of the probability to 
observe this frequency [13, US] ■ 

Most importantly, it is simply logically incorrect to 
use probability theory to even make a statement about 
the (non)existence of correlations in a set of experimen- 
tal data. At most, we can conclude that a probabilistic 
model is compatible with the data, in which case we made 
a significant step in describing the process that gave rise 
to the data. 

The simulation models that we describe in this paper 
do not rely on concepts of probability theory: They are 
purely ontological models of the EPRB experiment. The 
expression for the coincidence Eq. ^ cannot be written 
in terms of a product of two single-particle probabilities, 
an essential feature of the restricted class of local mod- 
els examined by Bell Hence, the fact that we have 

I 



discovered event-by-event simulation algorithms that (1) 
generate the same type of data as recorded in the exper- 
iments, (2) analyze data according to the experimental 
procedure to count coincidences, (3) satisfy Einstein's cri- 
teria of local causality, (4) do not rely on any concept of 
quantum theory or probability theory, but nevertheless 
reproduce the two-particle correlation of the singlet state 
and all other properties of a quantum system consisting 
of two 5 = 1/2 particles can never be in conflict with a 
theorem that has its roots in probability theory. 



VII. PROBABILISTIC MODEL OF THE 
SIMULATION ALGORITHM 

In this section, we use the probabilistic (Kolmogorov) 
approach to analyze the simulation model that we de- 
scribed in Section |Vl This section serves three purposes. 
First, it provides a rigorous proof that for to first order in 
W, the probabilistic description of the simulation model 
can exactly reproduce the results of quantum theory for a 
system of two 5* = 1/2 objects. Second, it illustrates how 
the presence of the time-window introduces correlations 
that cannot be described by a Bell- like "hidden- variable" 
model. Third, it reveals a few hidden assumptions that 
are implicit in the derivation of the specific, factorized 
form of the two-particle correlation that is essential to 
Bell's work. 

The first, fundamental step is to assume that the sim- 
ulation algorithm can be replaced by an abstract math- 
ematical model in which the quadruple {xi, X2,ti,t2} is 
a random variable and that the data occurs with prob- 
ability P{xi, X2,ti,t2\sii, SL2)- We then use the standard 
rules of probability theory to write this probability such 
that it can be evaluated analytically. 



Using the product rule (see Eg.f [76]) ). we may always express the probability for observing the data {xi, X2,ti,t2} 
as a sum over the mutual exclusive events. Thus, we may write 



P{xi,X2,ti,t2\ai,a2) = ^^^2)2 J d'^i^^2P{xi,X2,ti,t2\ai,a2,Si,S2)P{Si,S2\ai,a2), 



(92) 



where Si and S2 denote the three-dimensional unit vector representing the spin of the particles. Representation 
Eq. (|92p is an exact expression for P{xi^X2,ti,t2\ai,a2). In the simulation model, 0:2, ti, ^2} are mutually 
independent and {xi,ti} ({x2, 12}) do not depend on {a2, S2} ({ai. Si}). This suggests that it is reasonable to assume 
that {xi, 2:2, ii, ^2} are mutually independent random variables and that {xi,ti} ({2:2,^2}) are logically independent 
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of {32,82} ({ai,Si}). Then, we have 

P(xi,X2,ti,t2|ai,a2) = J^~Tj2 J ^Si^S2f'(a;i,ti|a;2,t2,ai,a2,Si,S2)P(a;2,i2|ai,a2,Si,S2)P(Si,S2|ai,a2) 

rfSirfS2P(xi,ti|ai,Si)P(a:2,t2|a2,S2)F(Si,S2|ai,a2) 
dSi dS2 P(xi|ai, Si)P(ii|ai, Si)P(x2|a2, S2)P(<2|a2, S2)P(Si, Ssjai, aa) 
dSi dS2 P(xi|ai, Si)P(ii|ai, Si)P(x2|a2, S2)P(t2|a2, S2)P(Si, S2), (93) 



(47r2)2 
1 

(4^ 
1 

(4^ 
1 

where, in the last step, we assumed that Si and S2 are logicahy independent of ai and a2, which is reasonable because 
in the simulation algorithm Si and S2 are independent of ai and a2. Note that Eq. ((93)) gives the exact probabilistic 
description of our simulation model. 

The reader may wonder why in the present case it is allowed to go from Eq. (l92|) to Eq. (I93|) while in Section IVll 
we demonstrated that making these steps may lead to logical inconsistencies. The difference is in the fact that in 
Section IVIl we use probability theory to make inferences about logical dependencies whereas in the present case we 
know for certain (by assumption) which variables are logically dependent on others and which variables are not. Thus, 
in the present case it is mathematically correct to describe our simulation model by the probability Eq. (|93|) . However, 
if we analyze data for logical dependencies, it is logically inconsistent to draw conclusions from an analysis based on 
Eq. ([93)1 . In essence, we are repeating ourselves: We can cross the line in Fig. [U separating model space from data 
space from right to left because we know the properties of our simulation model but crossing the line in the opposite 
direction is impossible without making additional assumptions. 

Up to this point, Eq. has the same structure as the expression that is used in the derivation of Bell's results and 
if we would go ahead in the same way, our model also cannot produce the correlation of the singlet state. However, 
the real factual situation in the experiment is different: The events are selected using a time window W that the 
experimenters try to make as small as possible [H^. Accounting for the time window, that is multiplying Eq. ([M]) by 
the step function, and integrating over all ti and ^2, the expression for the probability for observing the event {xi, X2) 
reads 

^, . . /dSidS2P(a:i|ai,Si)P(.T2|a2,S2)u;(ai,a2,Si,S2,Ty)P(Si,S2) 
P(a:i,a;2|ai,a2) - 



E.i.x,=±i / dSi dS2 P(a;i|ai, Si)P(a;2|a2, S2)«;(ai , a2, Si, S2, M/)P(Si, S2) 
/ rfSi dS2 P(xi|ai, Si)P(a;2|a2, S2)w(ai, a2. Si, S2, T4^)P(Si, S2) 



(94) 



/ dSi dS2 u;(ai, a2. Si, S2, M^)P(Si, S2) 
where, in general, the weight function 

/+00 ;'+cxD 
dh dt2P(ii|ai,Si)P(t2|a2,S2)e(Ty-|ii-t2|), (95) 
-00 —00 

will be less than one (because Jl!^^ dti dt2 P{ti\ai,S)P{t2\a2,S) = 1) unless W is larger than the range of 
{ti,t2) for which P(ti|ai, Si) and P(t2|a2, S2) are nonzero. Unless z«(ai, a2, Si, S2, W) = wi{ai, Si, VF)u'2(a2, S2, W), 
Eq. cannot be written in the factorized form P{xi,X2\a, /3) = J P(xi |a, A)P(a;2 1/3, A)/9(A)(iA that is essential to 
derive the Bell inequalities (see Section |VI| . 

In the light of the discussion in Sections [T] and IIV CI it is not without importance to note that Eq. (|M|) can be 
written down directly (as we did in Section IV B 3p , without reference to concepts of probability theory. Indeed, it 
suffices to replace the sums over the pseudo-random numbers by discrete sums over equally spaced intervals and let 
these intervals go to zero. Then the total number of events goes to infinity and we recover Eq. (|94p . except that the 
P's that appear in Eq. (|94p do not have the meaning of probabilities. Again, we see that the use of probabilistic 
models requires additional assumptions, the correctness of which can be established a-posteriori only. 

First, let us consider Case II, that is we assume that the source emits pairs of particles with fixed, known directions 
Si and ^2. Then, P(Si, S2) = S{Si — 5i)^(S2 — ^2), the weight function ix;(ai,a2,5i,52, W) drops out and Eg. fM]) 
reduces to 

P(xi,a;2|ai,a2) = P(a;i|ai, 5i)P(x2|a2, ^2), (96) 



which agrees with the expression for the quantum system of two S* = 1/2 particles in the product state. 
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Second, we put P(Si,S2) = 5{Si + 82)- Then, Si = — S2 is a random variable that covers the unit sphere in a 
uniform manner, that is we are treating Case I. In our simulation model, the time delays ti are distributed uniformly 
over the interval [0, TJ where T, = Toll - (a; • S,)^\'^^^ for i ^ 1,2. Thus, P{ti\sLi, Si) = eiti)e(T, - U)/T, and 

u;(ai,a2,Si,S2,W,d) = [ ' dh [ ' dt2 e{W - \h - t2\), (97) 

-L 1-^2 Jo Jo 

where we added the parameter d to the list of variables to make explicit that we adopted the time-tag model that we 
employ in the simulation. The integrals in Eq.((97|) can be worked out analytically, yielding 

Zi;(ai,a2, Si, S2, W,d) = — [ + T| + 2{Ti + T2)W + {W ~ T^)\W - Ti| + (M^ - T2)\W - T2I 

4J li2 

-{W - Ti + T2)\W -Ti+ T2I -{W + Ti- T2)\W + Ti - T2I ]. (98) 

Clearly, Eq. ([98| cannot be written in the factorized form wi{a.i, Si, M^)u'2(a2, S2, W). Hence, it should not come as 
a surprise that as soon as we want to model the real experiment in which the time window is essential, we may obtain 
correlations that cannot be described by Bell-like models. 

We now consider the relevant limiting cases for which we can easily derive closed-form expressions for the expectation 
values. From Eq. (IM)) . it follows that 

z«(ai,a2,Si,S2,T4^^«3,d) = ii;(ai, a2. Si, S2, T4^ > Tq, rf) = 1, (99) 
«;(ai,a2,Si,S2,W^<ro,d = 0) = Wi2W-To)/Tl (100) 

2W 

z(;(ai,a2,Si,S2,T4^^0,rf) = ^7r^ + 0(W^')- (101) 

max(Ji, i2) 

If the weight function is a constant, as in Eqs. (|99p and (jlOOp . Eq. ([M|) reduces to 

P(xi,a;2|ai,a2) = / dSi dS2 P(x-i|ai, Si)P(a;2|a2, S2)P(Si, S2) ; d = or IF > Tq, (102) 



and takes the factorized form that is characteristic for the probabilistic models considered by Bell [13] ■ Hence, we 
know that we cannot recover the results of quantum theory in the limiting cases d = or 11^ > Tq in which the 
time-tag information plays no role. 

From now on, we focus on the experimentally relevant case of small 11^, that is we neglect contributions of 0{W'^). 
We insert in Eq. ([94|) . the probability distributions P(a;|a, S) = 9(a;a • S) or P(x|a, S) = (l-f xa • S)/2, corresponding 
to the deterministic and pseudo-random model for the Stern-Gerlach magnet respectively. By symmetry we have 
El (ai , a2 , ly ^ 0) = E2 (ai , a2 , 11^ ^ 0) = for all values of d, in agreement with quantum theory (see the second 
column of Table The two-particle correlations are given by 

f dS sign(ai • S)sign(a2 • S) max-i(ri, r2) 
E{Bii,a2,W ^ 0) = ^ ^ J n y ^ ) V 1, 



/dSmax-i(ri,r2) 
and 

r dS ai • S ao • S max^^fTi , To) 

i?(ai,a2,VK^0) = -J \ 103 

J dS max ^(Ji, J2) 

for the deterministic and the pseudo-random model, respectively. 

Without loss of generality, we may choose the coordinate system such that ai — (1,0,0) and a2 ~ (cos a, sin a, 0). 
Then, substitution of T, = \S^ x a, I'' into Eq. (fT03l) yields 



f dx C/'+'Z' d(l)g{(l>, 9, x)(sin2 cj) + cos^ ^)-'^/2 
i?(ai,a2,iy-0) = :Y 7 ' (104) 

Yl dx ^0 2 ' (^in^ + x2 C0S2 0)-'i/2 



where 5(0, 0, x) = sign(cos(/)cos(0 — 0)) or g{(j),6,x) — {1 ~ x"^) cos 4> cos{4> — 9) for the deterministic or pseudo-random 
model for the Stern-Gerlach magnet, respectively. Here and in the remainder of this section, we define cosfl = ai • a2. 
For specific values of d, Eq. (I104p can be written in terms of elementary functions. In the case of the deterministic 
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model for the Stern-Gerlach magnet, we find 



-1 H — arccos(cos 9) 

TT 



COS 9 

15 cos 6* - 7cos^6' 



ll-3cos2 6i 

6890 cos 9 - 895 cos 36* + 149 cos 56* 
5774 + 280 cos 261 + 90 cos 46* 



,d = 
,d = 3 

,fi = 5 
,d = 7 



(105) 



In the case of the pseudo-random model for the Stern-Gerlach magnet, we obtain 



S(ai,a2,W'^0) 



- i cos 9 
8cos6i 



- 3 sin 9 
2992 cos ( 



80 cos 36* 



2887 + 140 cos 261 + 45 cos 46* 

84026 cos 61 + 8169 cos 30 - 35 cos 561 
45666 + 16254 cos 29 - 1680 cos^ 6* 3150 cos^ 9 



,d = 
,d = 5 
,d=7 
.d = 9 



(106) 



All the d > results in Eqs. (|105p and (|106[) violate the Bell inequalities but, as we already explained, this finding 
has no significant consequences. From Eq. (|105p . we conclude that for d = 3 and the deterministic model of the 
Stern-Gerlach magnet, the expression is identical to the correlation of a system of two S* = 1/2 particles in the singlet 
state. The result for the pseudo-random model of the Stern-Gerlach magnet and d = 7 (see Eq. (|106p ) is very close 
(with a maximum error of less that 2%) to the singlet correlation. Of course, there is no fundamental reason why 
d should be an integer. Finally, we note the almost trivial fact that for W ^ 0, the results are insensitive to small 
variations in W, in agreement with the general idea, explored in the next section, that quantum theory is one out of 
the many probabilistic theories that has the special feature that its predictions are insensitive to small changes of the 
parameters of the model. 

For completeness, we list the analytical results for the case of the photon polarization. For the deterministic model 
of a polarizer (which does not reproduce Malus law) , the probabilistic treatment yields [33t 



- 1 H — arccos(cos 29) ,d 

TT 



In 


1+ 


cos 


?l 1-1 


sin 6 


1- 


COS 


?l 1+ 


sin 6*1 


In 


1+ 


COS 


n 1+1 


sin^l 


1- 


COS 


5| 1-1 


sin 6 



-cos 26* 

3cos26'- cos^ 26* 



,d 
,d 



(107) 



In the case that we adopt the pseudo-random model for the polarizer that can reproduce Malus law, the probabilistic 
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model yields [3£ 



£;(ai,a2,W^-^0) = < 



--cos 26* 
2 



- sin 261 cos 20 - cos 29 + ln[| tan 9 
4 

- cos 29 

43 cos 26* + 5 cos 26* cos 46* 



sin^ 2e/2l 



38- 
53 cos 26* 



10 cos 461 
- 7 cos 69 



39 + 21 cos 46* 



,d = 



= 6 



(108) 



where we have omitted the expressions for odd d because they cannot be written in terms of elementary functions. In 
passing, we note that the mathematically rigorous result for d = 4 disposes of the widespread believe that perfect 
correlation of the singlet state requires some form of determinism. 



VIII. DERIVATION OF THE QUANTUM 
THEORY OF THE EPRB EXPERIMENT 

In the quantum theoretical model, the choice of the 
state that describes the EPRB experiment is an educated 
guess. There is no underlying principle that guides us 
to this choice other than that the particular averages (of 
time series) that we compute from the experimental data 
agree with the expectation values (ensemble averages) 
obtained from the theory. 

From the work of Cox and Jaynes in the early 60 's, 
we know that once we have agreed to represent the de- 
gree of the plausibility of a proposition by a real number, 
then there is a unique set of rules, identical to the stan- 
dard rules of probability theory, that we must adhere 
to in order that the logical inferences we make do not 
violate elementary desiderata of rationality and consis- 
tency [13, SOj 113 ■ I'^ this case, the rules of probabihty 
theory are used as a vehicle for carrying out probable 
inference [s^, SO, S^l and have a much broader range of 
applications than the Kolmogorov theory of probability. 
The latter is incorporated in the former. As mentioned 
earlier, and as is most evident in Section IVIl we mainly 
use probability theory as a vehicle to make statements 
about propositions, that is we use it in its extended logic 
mode. 

An intriguing question now arises: Would it be possi- 
ble to derive the quantum theoretical description of the 
EPRB experiment from the general principles of logical 
inference and empirical knowledge about the results of 
the experiment, not involving concepts from quantum 
theory at all? Elsewhere, we have shown that Malus law 
can be derived in this manner [6^. In this section, we 
show that the same approach yields the probability dis- 
tributions Eqs. ((20l) and (|25p without making the detour 
via quantum theory. 

The approach that we take here is very much inspired 
by the work of Frieden 70] . Frieden has shown that one 



can recover all the fundamental equations of physics by 
finding the extremes of the Fisher information plus the 
"bound" information [tO] . According to Frieden, the act 
of measurement elicits a physical law and quantum me- 
chanics appears as the result of what Frieden calls "a 
smart measuremeiit" , a measurement that tries to make 
the best estimate [701 • Although our approach is similar 
to Frieden's, our line of reasoning is different. We do not 
invoke concepts from estimation theory, such as the esti- 
mators and the Cramer- Rao inequality, nor do we require 
the concept of random noise. 

The probabilistic model that we will develop is based 
on the following four hypotheses: 

1. Each detection event constitutes a Bernouilli trial, 
that is we assume that the events are logically in- 
dependent [13, m, Note that the absence 
of statistical correlation in the data recorded in 
an experiment is an indication but definitely not 
a proof that the events are logically indepen- 
dent [13, H El- On the other hand, if the data 
would exhibit correlations, the events would be log- 
ically dependent [13, H 13 . 

2. The time series recorded during an experiment sug- 
gest that the averages of the data are rotational 
invariant. This observation we formalize by mak- 
ing the hypothesis that the expectation values (not 
necessarily the probabilities) are invariant for rota- 
tions of the conditions under which the experiments 
are carried out. 

3. The model operates according to the principle of ef- 
ficient data processing 66'] : It generates the events 
such that the probability distribution is least sen- 
sitive to small variations in the conditions under 
which the experiment is carried out. In other 
words, the probability distribution should be as 
smooth as possible, for all values of the parame- 
ters that determine these conditions. 
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4. The time series that we observe is the one which is 
most likely to be observed, that is its probability is 
maximum. 

In the remainder of this section, we will simplify the nota- 
tion a little by omitting from the conditions that appear 
in the probabilities, the proposition that expresses the 
knowledge about the problem that we do not need to 
specify explicitly. 

We begin by demonstrating that these four assump- 
tions suffice to derive the probability distribution 



P{x\eL,S) 



1 + a:;a • S 



of a single Stern-Gerlach magnet. Then, using the same 
four assumptions, we derive the probability distribution 



P{x,y\ai,ii2) = 



1 - xysLi ■ a2 



(110) 



(109) for the EPRB experiment. 



J 



A. Stern-Gerlach magnet 



We consider the case that the direction a of the applied field in the Stern-Gerlach magnet and the magnetic moment 
S of the particles do not change with time. The measuring apparatus (Stern-Gerlach magnet -I- particle detector) 
transforms the input, N particles with magnetic moment S, into a time series {x„|n = 1, . ■ . , N} of signals x„ = ±1. 
By hypothesis (1), the probability P{xi, . . . , XArja, S, N) to observe the data record {a;„|n = 1, . . . , N} can be written 
as P(x„|a, S). As a; = ±1, P{x\a, S) is completely determined by its first moment, that is we can write 

PWa,S).i±ifi^. (Ill) 

where i?(a, S) = a;P(a;|a, S). By hypothesis (2), £'(a, S) = i?(a • S) and hence the probability for a single 

event x is given by 

P{x\a,S) ^ P{x\a-S) = P{x\d), (112) 

and is conditional on the relative angle 6 between the magnetic moment S of the particle and the direction a of the 
applied field. Denoting p{6) = P{x = -l-l|6'), the probability for observing a time series {x„|n = 1, . . . , N} in which 
TO of the events Xn take the value -|-1, that is Y^^^i ^" ~ ~ given by [13, [H, H^l 

P{m\e,N) = p"'(^)[l-p(g)]^— . (113) 

m\(i\ — my. 

We now consider the likelihood that the observed sequence of {xn} was generated by p{x\9 -t- e) instead oip{x\9), e 
being a small positive number. The log-hkclihood L that the data was generated by p{x\6 + e) instead of by p{x\9) is 
given by [13, H 

N N P{m\0,N) N p{9) ^ n' I - p{0) ^ ' 

According to hypothesis (3), the variation of L with e should be minimal. Then, the results (averages over the 
time-series) will be least sensitive to small variations of the conditions under which the experiment is carried out. 

We bring the problem of determining the function p{9) in a mathematically trackable form by using the Taylor 
expansion with respect to e. We find 

L _ {p'{9)f ^^m ^^^^/^ p'{9) {I ~ 2p{9)){p' {9)f + {I - p{9))p" {9) \ ^ ^^^^^ 



N 2 {1 - p{9))p{9) ' N " \ {l~p{9))p{9) 2 {I - p{9))^p^{9) 

Invoking hypothesis (4), to is the value that maximizes P{xi, . . . , XArja, S, N). A simple calculation (see Section lTV C 1|) 
shows that 



(116) 
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Hence, for large N we may set m/N = p{9) in Eq. (jllSp and then the second term of the right hand side vanishes. 
Then, L will be least sensitive to changes in e if 



If = 



1 



Pi9)il-pi0)) \ 09 J ' 
is minimal. The quantity Ip is the Fisher information [s^, [zO, [zH 

1 fdP{xi,...,XN\e,Ny^ 



If 



1 

N 



0^1, ...^=±1 ^(^1'- 



,xn\0,N) V 



de 



E 



1 /ap(a^ 



(117) 



(118) 



for this particular problem. Hypothesis (1) was used to obtain the right hand side of Eq. (|118p . which upon substitution 
of p(a; = +1|6') = p{e) and p(a:: = -116*) = 1 - p{9) turns into Eq. pT7|) . 

We find the minimum of the Fisher information Ip by substituting p{9) — cos^ g{9) and obtain 



If 



89 



(119) 



Rotational invariance requires that Ip is independent of 9, hence g{9) = a9 + b, where a and b are constants still 
to be determined. Rotational invariance further requires that p{9) = cos^ {a9 + b) = p{9 + 27r), hence a = k/2 and 
Ip ~ k^, with k an integer number. We may exclude the case k = because then p{9) does not depend on 9 and a 
Stern-Gerlach magnet that operates according to this fc = model would not be a useful device. Thus, Ip is minimal 
if fc = 1 and we may set the irrelevant phase factor b to zero. Therefore, using the four hypotheses given earlier, we 
have found that the probabilistic model for the Stern-Gerlach magnet generates events with probabilities 



P(x = +l|a,S) 



1 + a- S 



cos - = 
2 



P{x = -l|a,S) = sin^ - = 



1 -aS 



(120) 



which is in exact agreement with Eq. p09p . 



B. EPRB gedanken experiment 

We consider the case that the directions ai and a2 of the applied fields in the Stern-Gerlach magnets do not change 
with time (as in the quantum model) and that we have no knowledge about the direction of the magnetic moments 
Si and S2 of the particles. 

The measuring equipment (Stern-Gerlach magnets + particle detectors -I- time-coincidence logic) transforms the 
input, N pairs of particles with unknown magnetic moments into a time series {x„,y„|ri = l,...,Af} of signals 
Xn = ±1 and Un = ±1. By hypothesis (1), the probability ^((xi, yi), . . . , (a;7v, yA')|ai, a2, iV) to observe the data 
{xn, y7i\n = 1, • ■ • , N} can be written as Y[n Pi^n, J/ri|ai, a2). As x,y ~ ±1, P{x, ?/|ai, a2) can be written as 

I ^ l + xEi{ai,a2)+yE2iai,a2)+xyE{ai,a2) 
P{x,y\ai,a2) ^ , (121) 

where i;i(ai,a2) = ^^^,^=±1 2/1*^1' ^2), £^2(ai,a2) = X)a;,y=±i y-P(^' 2^1^!' ^2), and £'(ai,a2) = 

i/=±i ^2)- Using the empirical (experimental) knowledge that the averages are rotational invariant 

(hypothesis (2)), we have i?i(ai,a2) = i?i(ai •a2), i?2(ai,a2) — i?2(ai •a2), and i?(ai,a2) — E{ai •a2). Furthermore, 
experiments indicate that frequencies of the x = ±1 {y = ±1) events (not of the correlated events!) are the same. We 
formalize this knowledge by the hypothesis that 

Si(ai -as) = £;2(ai •a2) = 0, (122) 

from which it immediately follows that 

p/ I ^ l + xyE{ai -az) _ 1 + xyE{9) 

P(a;,?/|ai,a2) = =p{x,y\9)^ . (123) 
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Thus, the probabiHty p(x,y\0) for a single-event {x,y) is conditional on the relative angle between the two unit 
vectors ai and a2. 

The probability that an experiment of N events yields n(x, y) events of the type (x, y) is given by 

'7 ' „ , (124) 

,,,=±1 riix, yy. 

where n{+l, +1) + n(— 1, —1) + n{+l, —1) + n{—l, +1) = N. Adopting the same strategy as in the case of the single 
Stern-Gerlach magnet, we consider the log- likelihood 

L_ ^ 1 ^^ F(n(+l,+l),ri(-l,-l),n(+l,-l),7i(-l,+l)|g+e,jV) _ ^ nix,y) ^^ p{x,y\0 + e) 

N N P(n(+l,+l),n(-l,-l),n(+l,-l),n(-l,-M)|6',^) N pix,y\0) ' ^ ' 

that the data was generated by p{x,y\0 + e) instead of p{x,y\0). Repeating the steps that lead from Eq. (|114p to 
Eq. pi7p . we find that for small e minimization of L is tantamount to finding the probability that minimizes 

the Fisher information 

1 f dp{x,y\0) Y 



^^^^^pix,y\0) 

Using Eq. p23p . we can write Eq. (|126p as 



I 1 fdEi0)y 

which, in essence, is the same expression as the one that we obtained for the case of the Stern-Gerlach magnet. Of 
course, the solution of the minimization problem is also the same. Solving Eq. (jl27p for E{0), we find 



E{0) =iiin{0^/lF + b). (128) 

In the case that one uses the magnetic moment of the particles, the experimental data indicates that E{9) is periodic 
in 9 with a period of 27r (tt if the experiment measures the polarization, as in EPRB experiments with photons) . This 
implies that Ip should be an integer number. The solution Ip = Q can be discarded because then E(0) would not 
depend on 0, which would contradict the experimental observations. Therefore, the nontrivial solution with minimum 
Fisher information is Ip = 1- Using the fact that the solution of the minimization problem is determined up to an 
arbitrary phase 5, the two-particle correlation can be written as 

£;(ai,a2) = -cos6l = -ai -aa, (129) 

in agreement with the expression of the correlation of two = 1/2 particles in the singlet state. Thus, we may 
conclude that we can derive the results of quantum theory for the singlet state from a straightforward application of 
probability theory, without making reference to concepts of quantum theory. 



C. Real EPRB experiment 

As explained in Section |TT1 real EPRB experiments produce the data sets 

= {Xn,t = ±1, tn.i,SLn,iW = 1, ■ • • , iV} • (130) 

We assume that this data set can be described by a probabilistic model that satisfies hypothesis (1). Let 
P(a;i, a;2, ^1, i2|ai, 8-2) denote the probability for observing the data {a;i,ti} and {3:2, at station 1 and 2, re- 
spectively. Without loss of generality, we can use the exact representation 

, ,| ^ fo{ti,t2\a.i,a.2) + Xifi{ti,t2\&l,a.2) + X2f2{tl,t2\B.i,A2) + XiX2f'i{ti,t2\a.i,A2) 
P{Xi,X2,ti,t2\&i,a.2) = ^ , (131) 

to express the single- and two-particle correlations in terms of the functions /^(ii, i2|a-ii ^2) for z = 0, . . . , 3. Because 
< P(a;i, a;2, ii, i2|a-i, 8-2) ^ 1, the functions /^(ti, i2|a-i, 8-2) must satisfy the inequalities < /o(ii, i2|a-i, 8-2) i 
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/i(ti,f2|ai,a2) + /2(ii,^2|ai,a2) ± /3(ii, t2 |ai, a2) < 4 and < /o(<i, t2|ai, a2) ± i2|ai, a2) - /2(ii, *2|ai, a2) T 

/3(^i) i2|ai, a2) < 4. The mathematical expectation of the coincidences C^y (see Eq. ([3])), that is the average computed 
with P(a;i,a;2,ti,t2|ai,a2), is given by 

{Cxy) = N dh I dt2P(x,y,ii,i2|ai,a2)e(W^-|ii-t2|). (132) 

We find 

E{a^,a2,W) = ^ — — - -. (133) 

l^x,y=±i'\'-'xv) dti dt2Q{W - \ti - t2\)fa(ti,t2\ai,a2) 

At this point, we feel that we lack the necessary mathematical tools for carrying out the procedure that we successfully 
applied to the simpler cases treated earlier. First, it is difficult to see how the empirical knowledge that single-particle 
averages are zero and that the two-particle average is rotational invariant leads to useful conditions on the form of 
the /i(ii, i2|ai, a2). Second, the presence in Eq. (jl33p of the step functions introduces nontrivial correlations and 
prevents us from making further progress in the mathematical treatment of this problem. Third, the description now 
contains a new parameter (W to which we should also apply hypothesis (3)) as well as extra variables (<i and ^2)- 
We leave the problem of the analytical treatment of the general case for future research. 
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D. Summary 

The assumption that there is an underlying probabilis- 
tic process that gives rise to the observation of the data 
as obtained in Stern-Gerlach and EPRB experiments, to- 
gether with the very simple, plausible hypotheses (l)-(4) 
are sufficient to derive the probability distributions of 
quantum theory for the EPRB experiment, without us- 
ing a single concept of quantum theory. In addition, this 
derivation suggests that quantum theory is the proba- 
bilistic model for the set of data that is most likely to be 
observed. 

From a more general perspective, this section demon- 
strates, by way of a successful application to specific 
problems, how to formalize the process of inductive in- 
ference and derive useful results (those of quantum the- 
ory) from it. This derivation builds on prior, empirical 
knowledge that we have acquired through experiments, 
the application of probability theory as mathematical ve- 
hicle for rational reasoning, and the metaphysical prin- 
ciple that we, human observers, have great difficulties to 
interpret experimental data that is not robust with re- 
spect to small changes in the conditions under which the 
experiments are carried out |72l. 

IX. CONCLUSION 

Starting from nothing more than the observation 
that an EPRB experiment produces pairs of triples of 
data {Ti, T2}, we have constructed computer simulation 



models that reproduce the results of all single-particle 
and two-particle correlations of a quantum system of two 
5 = 1/2 particles. Salient features of these models are 
that they 

• Generate, event-by-event, the same kind of data set 
{Ti, T2} as the one recorded in real EPRB exper- 
iments 

• Satisfy Einstein's criteria of local causality 

• Count all events in which systems of two par- 
ticles have been detected, using the same time- 
coincidence criterion as used in real EPRB experi- 
ments 

• Provide a simple, rational and realistic picture of a 
mechanism that yields the correlations of an "en- 
tangled state" 

• Do not rely on any concept of quantum theory or 
probability theory 

A key ingredient of these models, not present in the 
textbook treatments of the EPRB gedanken experiment, 
is the time window W that is used to detect coincidences. 
We have demonstrated (see Section III Gp the importance 
of the choice of the time window by analyzing a data set 
of a real EPRB experiment with photons i33] . 

The mathematical treatment of the models yields re- 
sults that are in exact agreement with quantum theory. 
The condition under which an EPRB experiment yields 
results that agree with quantum theory is evident: The 
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resolution r of the devices that generate the time-tags 
and the time window W should be much smaller than the 
time delays, the range which is determined by Tq. Dis- 
regarding the timing data yields a result that disagrees 
with quantum theory and with experiment. The EPR 
paradox reappears when the experiments are analyzed in 
terms of an incomplete set of data. 

We have demonstrated that the event-by-event simu- 
lation of EPRB experiments allows us to reproduce not 
only the results of quantum theory but also allows us to 
consider cases that are not described by quantum theory. 
Therefore, for this type of experiments, the two-particle 
"world" that we can simulate contains the two-particle 
"world" described by quantum theory as a special case. 

As our work shows that it is possible to construct 
event-based simulation models that satisfy Einstein's 
criteria of local causality and reproduce the expecta- 
tion values calculated by quantum theory it opens new 
routes to ont olog ical descriptions of microscopic phenom- 
ena H, i, HH, MM m Hi il. 

We have resolved the apparent conflict between the 
fact that there exist event-based simulation models that 
satisfy Einstein's criteria of local causality and reproduce 
the results of the quantum theory of two 5 = 1/2 par- 
ticles and the folklore about Bell's theorem, stating that 
such models are not supposed to exist. The origin of 
this conflict has been traced back to Bell's extension of 
Einstein's concept of locality to the domain of proba- 
bilistic theories, the fundamental assumption being that 
the absence of a causal influence implies logical indepen- 
dence [36]. This leaves two options: 

• One accepts the assumption that the absence of a 
causal influence implies logical independence and 
lives with the logical paradoxes that this assump- 
tion creates 



Finally, we have demonstrated that it is possible to 
derive, without resorting to concepts of quantum theory, 
the quantum theoretical description of the EPRB exper- 
iment from the general principles of lo gica l inference, de- 
veloped by Cox and Jaynes, [13, SO, 113 and empirical 
knowledge about the results of the experiment. 

The computer models we have invented can be built 
with macroscopic, say mechanical parts (in principle a 
digital computer can be built from mechanical parts) . To 
the experimenter who has no knowledge of what is going 
on inside the building where the mechanical machine is 
operating, there is no way of telling whether the data 
he/she receives is generated by a quantum system or not. 
In a sense, this supports Bohr's point of view that "There 
is no quantum world. There is only an abstract quantum 
theoretical description" [tsI- 

Appendix 

For the singlet state, the probability P{x, y\a, b) reads 

1 — xya. ■ b 



P{x,y\a,h) = 



(134) 



where x,y = ±1, a and b are unit vectors. Let us now 
try to write Eq. (|134p in the form 

Fix, y\a, h)= j F{x, a, X)F{y, b, X)dX, (135) 

where A denotes a set of auxiliary variables that may be 
chosen at will. 

A simple solution to this problem is given by 

P{x,y\a,h)= / dS, (136) 



• One recognizes that logical independence and the 
absence of a causal influence are different con- 
cepts [13, HO, and one searches for rational ex- 
planations of experimental facts that are logically 
consistent, as we did in this paper 



where S — (sin cos sin6' sin0, cos 0) and the inte- 
gral is over the unit sphere. In this case, the function 
F{x, a, S) = (14- V^xa ■ S) /2 can take negative values 
and therefore it does not qualify as a probability distri- 
bution. 
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